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METHODS AND REAGENTS FOR DISCOVERING AND USING 
MAMMALIAN MELANOCORTIN RECEPTOR AGONISTS AND 
ANTAGONISTS TO MODULATE FEEDING BEHAVIOR IN ANIMALS 

BACKGROUND OF THE INVENTION 
i. FieW of the Invention 

The present invention relates to the cloning, expression and functional 
characterization of mammalian melanocortin receptor genes. The invention provides 
nucleic acid encoding mammalian melanocortin receptors, recombinant expression 
constructs comprising said nucleic acid, and mammalian cells into which said 
recombinant expression constructs have been introduced, and that express functional 
mammalian melanocortin receptors. The invention also provides a panel of such 
transformed mammalian cells expressing melanocortin receptors for screening 
compounds for receptor agonist and antagonist activity. The invention provides methods 
for using such panels of melanocortin receptor-expressing mammalian cells to 
specifically detect and identify agonists and antagonists for each melanocortin receptor, 
as well as patterns of agonist and antagonist activity of said compounds for the class of 
melanocortin receptors. Such screening methods provide a means for identifying 
compounds with patterns of melanocortin agonist and antagonist activity which is 
associated with the capacity to influence or modify physiological function and behavior, 
particularly metabolism and feeding behavior. 

2. Background of th? Ipventiop 

The proopiomelanocortin (POMC) gene product is processed to produce a large 
number of biologically active peptides. Two of these peptides, a-melanocyte stimulating 
hormone (aMSH), and adrenocorticotropic hormone (ACTH) have well understood roles 
in control of melanocyte and adrenocortical function, respectively. Both of these 
hormones are also found in a variety of forms with unknown functions, for example, y- 
melanocyte stimulating hormone (yMSH), which has little or no ability to stimulate 
pigmentation (Ling et aL y 1 979, Life ScL 25: 1 773-1 780; Slominski et aL y 1 992, Life ScL 
1Q: 1 103-1 108). A melanocortin receptor gene specific for each of the aMSH, ACTH 
and yMSH hormones has been discovered by some of the present inventors {see U.S. 
Patent Nos. 5,280,112, .5,532,347 and U.S. Application Serial No. 08/044,812, 
incorporated by reference herein). In addition, two other melanocortin receptor genes 
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have been discovered by some of the present inventors (see Lu et al, 1 994, Nature 371 : 
799-802; Mountjoy et al % 1994, Molec. Endocrinol &: 1298-1308) and others (see 
Gantze/p/., 1993,J. Biol Chem. 2££: 15174-15179 and Labbe et al, 1994, Biochem. 
21: 4543-4549). 

5 Along with the well-recognized activities of aMSH in melanocytes and ACTH 

in adrenal and pituitary glands, the melanocortin peptides also have a diverse array of 
biological activities in other tissues, including the brain and immune system, and bind 
to specific receptors in these tissues with a distinct pharmacology (see, Hanneman et al y 
in Peptide Hormone as Prohormones, G. Martinez, ed. (Ellis Horwood Ltd.: Chichester, 
10 UK) pp. 53-82; DeWied & Jolles, 1982, Physiol. Rev. £2: 976-1059 for reviews). A 
complete understanding of these peptides and their diverse biological activities requires 
the isolation and characterization of their corresponding receptors. Some biochemical 
studies have been reported in the prior art. 

Shimuze, 1985, Yale J. Biol. Med. 58: 561-570 discusses the physiology of 
15 melanocyte stimulating hormone. 

Tatro & Reichlin, 1987, Endocrinology 121: 1900-1907 disclose that MSH 
receptors are widely distributed in rodent tissues. 

Solae/a/., 1989, J. Biol. Chem.264> 14277-14280 disclose the molecular weight 
characterization of mouse and human MSH receptors linked to radioactively and 
20 photoaffinity labeled MSH analogues. 

Siegrist et al., 199 1 , J. Receptor Res. 11: 323-33 1 disclose the quantification of 
receptors on mouse melanoma tissue by receptor autoradiography. 

Cone & Mountjoy, U.S. Patent No. 5,532,347 disclose the isolation of human 
and mouse ct-MSH receptor genes and uses thereof (incorporated herein by reference). 
25 Cone & Mountjoy, U.S. Patent No. 5,280,1 12 disclose the isolation of human 

and bovine ACTH receptor genes and uses thereof (incorporated herein by reference). 

Mountjoy et aL, 1992, Science 251: 1248-1251 disclose the isolation of cDNAs 
encoding mammalian ACTH and MSH receptor proteins. 

POMC neurons are present in only two regions of the brain, the arcuate nucleus 
30 of the hypothalamus, and the nucleus of the solitary tract of the brain stem. Neurons 
from both sites project to a number of hypothalamic nuclei known to be important in 
feeding behavior, including the paraventricular nucleus, lateral hypothalamic area, and 
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ventromedial hypothalamic nucleus. While previous reports have claimed both 
stimulatory and inhibitory effects of a-MSH on feeding behavior (see Shimizu et al 9 
1989, Life ScL 45: 543-552; Tsujii et al y 1989, Brian Res. Bull 22: 165-169), 
knowledge of specific melanocortin receptors, their location within the central nervous 
system and the necessary pharmacological tools were not sufficiently developed at that 
time to allow the resolution of this issue. The present inventors have shown herein that 
a novel antagonist of the MC-3 and MC-4 melanocortin receptors can substantially 
increase food consumption in animals engaged in normal or fast-induced feeding 
behavior. This is consistent with expression of both MC-3 and MC-4 receptor mRNAs 
at these sites in in situ hybridization studies (Roselli-Rehfuss et al y 1993, Proc T Natl. 
Acad. ScL USA 9Q: 8856-8860; Mountjoy et al> 1994, Molec. Endocrinol &: 1298- 
1 308). Moreover, the regulation of arcuate nucleus POMC gene expression is consistent 
with an inhibitory role for POMC in feeding behavior. POMC mRNA levels arc 
decreased following a fast (Bergendahl et al 9 1992, Neuroendocrinol 913-920; 
Brady et al y 1990, Neuroendocrinol 52: 441-447), and a significant diurnal variation 
in POMC mRNA levels in the arcuate nucleus is seen in rat, with the nadir occurring 
around the onset of nighttime feeding at 1800 hrs (Steiner et al y 1994, FASEB J. £: 479- 
488). 

Thus, the experimental evidence indicates that POMC neurons play an important 
role in tonic inhibition of feeding behavior, wherein obesity results from a chronic 
disruption of this inhibitory tone by antagonism of central melanocortin receptors in at 
least one animal model (agouti). 

These results reveal for the first time a need in the art for a means for 
characterizing mammalian melanocortin receptor agonists and antagonists in vitro for 
the development of compounds that affect feeding behavior in animals. 

SUMMARY OF THE INVENTION 

The present invention provides a biological screening system for identifying and 
characterizing compounds that are agonists or antagonists of mammalian melanocortin 
receptors. The biological screening system of the invention comprises a panel of 
transformed mammalian cells comprising a recombinant expression construct encoding 
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a mammalian melanocortin receptor, and expressing said receptor thereby. The 
invention provides such a panel of transformed mammalian cells wherein the panel 
comprises.cells expressing each type of mammalian melanocortin receptor. Thus, the 
invention also provides nucleic acid encoding mammalian melanocortin receptors, 
5 recombinant expression constructs comprising said nucleic acid, and mammalian cells 

into which said recombinant expression constructs have been introduced, and that 
express functional mammalian melanocortin receptors. Methods for using such panels 
of melanocortin receptor-expressing mammalian cells to specifically detect and identify 
agonists and antagonists for each melanocortin receptor, as well as patterns of agonist 
10 and antagonist activity of said compounds for the class of melanocortin receptors, are 

also provided. Such screening methods provide a means for identifying compounds with 
patterns of melanocortin agonist and antagonist activity which is associated with the 
capacity to influence or modify metabolism and behavior in an animal, particularly 
feeding behavior. 

15 Thus, the invention provides in a first aspect a biological screening panel for 

determining the melanocortin receptor agonist/antagonist profile of a test compound. 
The panel comprises a first mammalian cell comprising a recombinant expression 
construct encoding a mammalian melanocortin receptor that is the a-MSH (MC-1) 
receptor. The panel also comprises a second mammalian cell comprising a recombinant 

20 expression construct encoding a mammalian melanocortin receptor that is the ACTH 
(MC-2) receptor. The panel also comprises a third mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocortin receptor that is 
the MC-3 receptor. The panel also comprises a fourth mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocortin receptor that is 

25 the MC-4 receptor. The panel also comprises a fifth mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocortin receptor that is 
the MC-5 receptor. As provided by the invention, each mammalian cell expresses the 
melanocortin receptor encoded by the recombinant expression construct comprising said 
cell. 

30 In preferred embodiments, the melanocortin receptors encoded by the 

recombinant expression constructs comprising the transformed mammalian cells 
comprising the panel are mouse MC-1 receptor (SEQ ID Nos.: 3 and 4); human MC-1 



-4- 



WO 98/10068 



PCTAJS97/15565 



receptor (SEQ ID Nos.: 5 and 6), human MC-2 (ACTH) receptor (SEQ ID Nos.: 7 and 
8), bovine MC-2 receptor (SEQ ID Nos.: 9 and 10), rat MC-3 receptor (SEQ ID Nos.: 
1 1 and 12), human MC-4 receptor (SEQ ID Nos.: 15 and 16) and mouse MC-5 receptor 
(SEQ ID Nos.: 17 and 18). 

In a second aspect, the invention provides a method for using the melanocortin 
receptor panel to identify and characterize test compounds as melanocortin receptor 
agonists and/or antagonists. In this embodiment, the method provided by the invention 
identifies a melanocortin receptor agonist, and comprises the steps of contacting each 
of the cells of the panel with a test compound to be characterized as an agonist of a 
mammalian melanocortin receptor and detecting binding of the test compound to each 
of the mammalian melanocortin receptors by assaying for a metabolite produced in the 
cells that bind the compound. In a preferred embodiment, the detected metabolite is 
cAMP. 

In a preferred embodiment of this method, each of the cells of the panel of 
mammalian cells expressing mammalian melanocortin receptors further comprises a 
recombinant expression construct encoding a cyclic AMP responsive element (CRE) 
transcription factor binding site that is operatively linked to a nucleic acid sequence 
encoding a protein capable of producing a detectable metabolite. In preferred 
embodiments, said protein is (i-galactosidasc, most preferably encoded by a nucleic acid 
comprising the recombinant expression construct identified as pCRE/p-galactosidase (as 
disclosed in Chen et al y 1994, Analyt. Biochem. 22&: 349-354). As provided by the 
invention, expression of the protein that produces the detectable metabolite is dependent 
on binding of the test compound to the melanocortin receptor expressed by each cell in 
the panel and the intracellular production of cAMP as a result. In this embodiment, 
cAMP production results in expression of a protein capable of producing a detectable 
metabolite, the protein most preferably being p-galactosidase. In preferred 
embodiments, the detectable metabolite absorbs light to produce a colored product. 
Thus, this embodiment of the invention provides a panel of melanocortin receptor- 
expressing cells whereby melanocortin hormone binding results in the production of a 
colored product in proportion to the extent of cAMP production in the cell as a result of 
hormone receptor binding. 
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In another embodiment of this aspect of the invention is provided a method for 
characterizing a compound as an antagonist of a mammalian melanocortin receptor. In 
this embodiment, the method comprises the steps of contacting each of the cells of the 
panel with an agonist of the mammalian melanocortin receptor in an amount sufficient 
5 to produce a detectable amount of a metabolite produced in the cells that bind the 

agonist, in the presence or absence of a test compound to be characterized as an 
antagonist of a mammalian melanocortin receptor, and detecting the amount of the 
metabolite produced in each cell in the panel in the presence of the test compound with 
the amount of the metabolite produced in each cell in the panel in the absence of the test 

10 compound. As provided by the assay, inhibition of the production of the detectable 
metabolite is used as an indication that the tested compound is a melanocortin receptor 
antagonist, which is further characterized quantitatively by the extent of said inhibition. 

In a preferred embodiment of this method, each of the cells of the panel of 
mammalian cells expressing mammalian melanocortin receptors further comprises a 

15 recombinant expression construct encoding a cyclic AMP responsive element (CRE) 

transcription factor binding site that is operatively linked to a nucleic acid sequence 
encoding a protein capable of producing a detectable metabolite. In preferred 
embodiments, said protein is p-galactosidase, most preferably encoded by a nucleic acid 
comprising the recombinant expression construct identified as pCRE/p-galactosidase. 

20 As provided by the invention, expression of the protein that produces the detectable 
metabolite is dependent on binding of the test compound to the melanocortin receptor 
expressed by each cell in the panel. In preferred embodiments, the detectable metabolite 
absorbs light to produce a colored product. Thus, this embodiment of the invention 
provides a panel of melanocortin receptor-expressing cells whereby melanocortin 

25 hormone binding results in the production of a colored product in proportion to the 

extent of cAMP production in the cell as a result of hormone receptor binding. 

The invention also provides melanocortin receptor agonists identified by the 
methods and using the screening panel of the invention. In preferred embodiments, the 
agonist is an agonist of the MC-3 mammalian melanocortin receptor. In other preferred 

30 embodiments, the agonist is an agonist of the MC-4 mammalian melanocortin receptor. 

The invention provides melanocortin receptor antagonists identified by the 
methods and using the screening panel of the invention. In preferred embodiments, the 
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antagonist is an antagonist of the MC-3 mammalian melanocortin receptor. In other 
preferred embodiments, the antagonist is an antagonist of the MC-4 mammalian 
melanocortin receptor. 

The invention also provides methods for characterizing mammalian melanocortin 
receptor agonists for the capacity to modify or influence metabolism and feeding 
behavior in an animal. In a first aspect, the invention provides a method for 
characterizing melanocortin receptor MC-3 or MC-4 agonists as inhibitors of feeding 
behavior in an animal, the method comprising the steps of providing food to an animal 
that has been deprived of food for at least 12 hours, with or without administering to the 
animal an MC-3 or MC-4 receptor agonist of the invention, and comparing the amount 
of food eaten by the animal after administration of the MC-3 or MC-4 receptor agonist 
with the amount of food eaten by the animal without administration of the MC-3 or MC- 
4 receptor agonist. 

In another aspect, the invention provides a method for characterizing a 
melanocortin MC-3 or MC-4 receptor antagonist as a stimulator of feeding behavior in 
an animal. In this embodiment, the method comprises the steps of providing food to an 
animal not deprived of food for at least 12 hours, with or without administering to the 
animal an MC-3 or MC-4 receptor antagonist, immediately prior to the onset of darkness 
or nighttime, and comparing the amount of food eaten by the animal after administration 
of the MC-3 or MC-4 receptor antagonist with the amount of food eaten by the animal 
without administration of the MC-3 or MC-4 receptor antagonist. 

Thus, the invention also provides methods for using certain of the melanocortin 
receptor agonists and antagonists for modifying feeding behavior in an animal. In a first 
aspect, the invention provides a method for stimulating feeding in an animal, the method 
comprising administering to the animal an MC-3 or MC-4 receptor antagonist. In a 
preferred embodiment, the antagonists are administered systemically. In additional 
embodiments, the antagonists are administered intracerebroventricularly. 

In another aspect, the invention provides a method for inhibiting feeding in an 
animal, the method comprising administering to the animal an MC-3 or MC-4 receptor 
agonist. In a preferred embodiment, the agonists are administered systemically. In 
additional embodiments, the agonists are administered intracerebroventricularly. 
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In yet another aspect, the invention provides mammalian melanocortin receptor 
agonists having the general formula: 

A-B-C-D-E-F-G-amide 
wherein A is an aliphatic amino acid residue, including for example Leu, He, Nle and 
5 Met, as well as analogues and substituted derivatives thereof; B is an acidic amino acid 

residue, including for example Asp and Glu; C is a basic amino acid residue, such as 
His; D is an aromatic amino acid residue having a D- conformation, including D-Phe, D- 
Tyr and substituted derivatives thereof, E is a basic amino acid residue, for example Arg, 
Lys, homoArg, homoLys, and analogues or substituted derivatives thereof; F is Trp or 
10 substituted derivatives thereof; and G is Lys, homoLys or a substituted derivative 

thereof In the peptide embodiments of the melanocortin receptor agonists of the 
invention, the peptide is cyclized by the formation of an amide bond between the side 
chain carboxyl group of the Asp or Glu residue at position B in the peptide, and the side 
chain amino group of the Lys or homoLys residue at position G. In preferred 
15 embodiments, the melanocortin receptor agonists of the invention are agonists of the 

MC-3 or MC-4 receptor. 

The invention also provides mammalian melanocortin receptor antagonists 
having the general formula: 

A-B-C-D-E-F-G-amide 

20 wherein A is an aliphatic amino acid residue, including for example Leu, He, Nle and 
Met, as well as analogues and substituted derivatives thereof; B is an acidic amino acid 
residue, including for example Asp and Glu; C is a basic amino acid residue, such as 
His; D is an aromatic amino acid residue having a D- conformation, including D-Nal and 
substituted derivatives thereof; E is a basic amino acid residue, for example Arg, Lys, 

25 homoArg, homoLys, and analogues or substituted derivatives thereof; F is Trp or 
substituted derivatives thereof; and G is Lys, homoLys or a substituted derivative 
thereof. In the peptide embodiments of the melanocortin receptor antagonists of the 
invention, the peptide is cyclized by the formation of an amide bond between the side 
chain carboxyl group of the Asp or Glu residue at position B in the peptide, and the side 

30 chain amino group of the Lys or homoLys residue at position G. In preferred 
embodiments, the melanocortin receptor antagonists of the invention are agonists of the 
MC-3 or MC-4 receptor. 
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It is an advantage of the present invention that it provides an in vitro screening 
method for characterizing compounds having melanocortin receptor activities that relate 
to feeding behavior in animals. Specifically, the invention advantageously provides 
means and methods for identifying compounds having melanocortin receptor agonist 
and/or antagonist activity that have been associated with either stimulating or inhibiting 
feeding behavior when administered to an animal. The invention thus provides an 
economical first step in screening compounds for the capacity to affect feeding behavior, 
including synthetic, peptidomimetic or organomimetic derivatives of melanocortin 
receptor agonists or antagonists as disclosed herein or elsewhere. 

Specific preferred embodiments of the present invention will become evident 
from the following more detailed description of certain preferred embodiments and the 
claims. 

DESCRIPTION OF THE DRAWINGS 

Figures 1 A and IB illustrate the nucleotide (SEQ ID No.: 3) and amino acid 
(SEQ ID No.: 4) sequence of the mouse melanocyte stimulating hormone receptor gene. 

Figures 2A and 2B illustrate the nucleotide (SEQ ID No.: 5) and amino acid 
(SEQ ID No.: 6) sequence of the human melanocyte stimulating hormone receptor gene. 

Figures 3 A and 3B illustrate the nucleotide (SEQ ID No.: 7) and amino acid 
(SEQ ID No.: 8) sequence of the human adrenocorticotropic stimulating hormone 
receptor gene. 

Figures 4A and 4B illustrate the nucleotide (SEQ ID No.: 9) and amino acid 
(SEQ ID No.: 10) sequence of the bovine adrenocorticotropic stimulating hormone 
receptor gene. 

Figures 5 A and 5B illustrate the nucleotide (SEQ ID No.: 11) and amino acid 
(SEQ ID No.: 12) sequence of the rat melanocortin-3 receptor gene. 

Figures 6 A and 6B illustrate the nucleotide (SEQ ID No.: 15) and amino acid 
(SEQ ID No.: 16) sequence of the human melanocortin-4 receptor gene. 

Figures 7A and 7B illustrate the nucleotide (SEQ ID No.: 17) and amino acid 
(SEQ ID No.: 1 8) sequence of the mouse melanocortin-5 receptor gene. 
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Figure 8 shows a graph of intracellular cAMP accumulation resulting from 
melanocyte stimulating hormone receptor agonist binding in human 293 cells transfected 
with a MSH receptor-encoding recombinant expression construct, wherein -□- represents 
binding of NDP-MSH, «o- represents binding of ACTH and -A- represents binding of 
5 aMSH. 

Figure 9 illustrates the cAMP response of mouse Yl cells to binding of 
melanocortin peptides to human melanocortin-2 (ACTH) receptor, as measured by the 
P-galactosidase assay described in Example 4, wherein represents binding to wild- 
type ACTH-R and -A- represents binding to an ACTH-R variant. 

10 Figure 10 illustrates the results of competition binding experiments of 

melanocortin peptides to cells expressing a recombinant expression construct encoding 
the rat melanocortin-3 receptor, wherein -■- represents binding of NDP-MSH, -A- 
represents binding of yMSH, -•- represents binding of aMSH, -o- represents binding 
of ACTH4_,o and represents binding of ORG2766. 

15 Figures 11A through 11C illustrate the results of experiments showing 

intracellular cAMP accumulation caused by receptor-ligand binding in human 293 cells 
expressing the MC-3 receptor. In Figure 1 1 A, -•- represents binding of aMSH, -■- 
represents binding of Y2-MSH, -A- represents binding of des-acetyl aMSH and -□- 
represents binding of ACTH,_ 39 . In Figure 1 IB, -•- represents binding of Yr M SH, -■- 

20 represents binding of y 2 -MSH and -A- represents binding of des-acetyl ^ -MSH. In 
Figure 1 1C, -•- represents binding of ACTH4_ I0 , -■- represents binding of NDP-MSH 
and -A- represents binding of ORG2766. 

Figure 12 shows a graph of intracellular cAMP accumulation resulting from 
peptide binding to human melanocortin-4 receptor agonist in human 293 cells 

25 transfected with a MC-4 receptor-encoding recombinant expression construct, wherein 

-□- represents binding of ACTH^o, represents binding of ACTH,. 39 , -■- represents 
binding of NDP-MSH, -o- represents binding of aMSH, -A- represents binding of y 2 - 
MSH, and -A- represents binding of des-acetyl aMSH. 

Figure 13 illustrates the results of cAMP accumulation and cAMP-dependent p- 

30 galactosidase assays of melanocortin peptide binding to a rat melanocortin-5 receptor, 

wherein -□- represents binding of aMSH, -A- represents binding of p-MSH, and -o- 
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represents binding of y-MSH, each determined using the p-gal method, and wherein -■- 
represents binding of aMSH, -A- represents binding of P-MSH, and represents 
binding of y-MSH, each determined using the cAMP method. 

Figure 14 illustrates the structure of the pCRE/ P-gal plasrnid. 
Figure 15 illustrates the results of the P-galactosidase-coupled, colorimetric 
melanocortin receptor binding assay using cells expressing each of the MC- 1, MC-3, 
MC4 or MC-5 receptors and contacted with aMSH or a variety of aMSH analogues, 
wherein «■- represents binding of aMSH, -A- represents binding of NDP-MSH, 
represents binding of SHU9128 (para-Fl substituted), represents binding of 
SHU9203 (p-Cl substituted), -A- represents binding of SHU8914 Q>I substituted), and 
-o- represents binding of SHU9119. 

Figures 16A through 16 D show the results of the P-galactosidase-coupled, 
colorimetric melanocortin receptor binding assay to determine antagonist activity of 
melanocortin analogues SHU9119 and SHU8914 in cells expressing each of the 
melanocortin receptors MC-3 and MC-4. In Figure 16A, -■- represents binding of 
aMSH, represents binding of lOOnM SHU91 19, -A- represents binding of lOnM 
SHU91 1 9, and -o- represents binding of InM SHU91 1 9. In Figure 1 6B, -■- represents 
binding of aMSH, represents binding of lOOnM SHU91 19, -A- represents binding 
of 50nM SHU9119, and -o- represents binding of lOnM SHU91 1 9. In Figure 16C, -■- 
represents binding of aMSH, -□- represents binding of lOOOnM SHU8914, -A- 
represents binding of lOOnM SHU8914, and -o- represents binding of lOnM SHU8614. 
In Figure 16D, -■- represents binding of aMSH, -□- represents binding of lOOnM 
SHU8914, -A- represents binding of 50nM SHU8914, and -o- represents binding of 
lOnM SHU8614. 

Figure 17 shows the results of classic competition binding assays using the 
melanocortin analogues SHU91 19 and SHU8914 at the MC3-R and MC-4 R receptors, 
wherein -■- represents binding of NDP-MSH, -A- represents binding of SHU8914 (p-1 
substituted), and -o- represents binding of SHU91 19. 

Figures 18A and 18B shows the results of cAMP accumulation experiments 
(performed using the P-galactosidase assay of Example 4) for rat MC-3 receptor (Figure 
18 A) and for mouse MC-4 receptor (Figure 18B). In Figure 18 A, -■- represents 
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binding of NDP-MSH, -A- represents binding of MTII and represents binding of 
forskolin. In Figure 18B, represents binding of MTII, -A- represents binding of 
NDP-MSIi and -▼- represents binding of forskolin. 

Figures 19A through 19C show the effect on food intake of 
5 intracerebroventricular administration of melanocortin analogue SHU91 19 in mice. In 

Figure 19 A, -■- represents administration of acsf (n=7) and represents 
administration of 6nmol of SHU9119 (n=6). In Figure 19B, -■- represents 
administration of acsf (n=6) and -□- represents administration of 6nmol of SHU91 19 
(n=6). In Figure 19C, -o- represents administration of acsf (n=l 1) and represents 
10 administration of 6nmol of SHU91 19 (n=12). 

Figures 20A through 20C show the effect on food intake of 
intracerebroventricular administration of melanocortin analogue MTII in mice. In 
Figure 20A, -•- represents administration of acsf (n=8), -▼- represents administration 
of O.lnmol MTII (n=8), -■- represents administration of Inmol MTII (n=7) and -A- 
15 represents administration of 3nmol MTII (n=9). In Figure 20B, -•- represents 

administration of acsf (n=12), -□- represents administration of 3nmol MTII and 6nmol 
SHU91 19 (n=9) and -A- represents administration of 3nmol MTII (n=9). 

Figure 20D shows the effect on locomotor activity of intracerebroventricular 
administration of melanocortin analogue MTII in mice, wherein -■- represents 
20 administration of vehicle alone (n=6) and -A- represents administration of 3nmol MTII 
(n=6). 

Figures 21 A through 2 ID show the effect on food intake of 
intracerebroventricular administration of melanocortin analogue MTII in mice. In 
Figure 21 A, represents administration of acsf (n=6) and -A- represents 
25 administration of 3nmol MTU (n=7). In Figure 21B, open bars represent administration 

of acsf (n=6), solid bars represents administration of 1.18nmol neuropeptide Y (NPY; 
n=6) and stipled bars represents administration of 3nmol MTII and 1.18nmol NPY 
(n=6). In Figure 21C, -•- represents administration of acsf (n=7) and -A- represents 
administration of 3nmol MTII (n=7). In Figure 21D, -■- represents administration of 
. 30 lOOnmol MTII (n=6) and - A- represents administration of vehicle alone (n=6). 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The term "melanocortin receptor" as used herein reference to proteins having the 
biological activity of any of the disclosed melanocortin receptors, including the MC-1 
(SEQ ID Nos.: 3, 4, 5 and 6), MC-2 (ACTH; SEQ ID Nos.: 7, 8, 9 and 10), MC-3 (SEQ 
ID Nos.: 1 1 and 12), MC-4 (SEQ ID Nos.: 15 and 16) or MC-5 (SEQ ID Nos.: 1 7 and 
1 8) receptors, as well as naturally-occurring and genetically-engineered allelic variations 
in these sequences. 

Cloned nucleic acid provided by the present invention may encode MC receptor 
protein of any species of origin, including, for example, mouse, rat, rabbit, cat, and 
human, but preferably the nucleic acid provided by the invention encodes MC receptors 
of mammalian, most preferably rodent and human, origin. 

The production of proteins such as the MC receptors from cloned genes by 
genetic engineering means is well known in this art. The discussion which follows is 
accordingly intended as an overview of this field, and is not intended to reflect the full 
state of the art. 

DNA which encodes MC receptors may be obtained, in view of the instant 
disclosure, by chemical synthesis, by screening reverse transcripts of mRNA from 
appropriate cells or cell line cultures, by screening genomic libraries from appropriate 
cells, or by combinations of these procedures, as illustrated below. Screening of mRNA 
or genomic DNA may be carried out with oligonucleotide probes generated from the MC 
receptor gene sequence information provided herein. Probes may be labeled with a 
detectable group such as a fluorescent group, a radioactive atom or a chemiluminescent 
group in accordance with know procedures and used in conventional hybridization 
assays, as described in greater detail in the Examples below. In the alternative, MC 
receptor gene sequences may be obtained by use of the polymerase chain reaction (PCR) 
procedure, with the PCR oligonucleotide primers being produced from the MC receptor 
gene sequences provided herein. See U.S. Patent Nos. 4,683,195 to Mullis et al. and 
4,683,202 to Mullis. 

MC receptor proteins may be synthesized in host cells transformed with a 
recombinant expression construct comprising a nucleic acid encoding each of the 
receptors disclosed herein. Such a recombinant expression construct can also be 
comprised of a vector that is a replicable DNA construct. Vectors are used herein either 
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to amplify DNA encoding an MC receptor and/or to express DN A which encodes an MC 
receptor. For the purposes of this invention, a recombinant expression construct is a 
replicablei DNA construct in which a DNA sequence encoding an MC receptor is 
operably linked to suitable control sequences capable of effecting the expression of the 
5 receptor in a suitable host cell. The need for such control sequences will vary depending 

upon the host selected and the transformation method chosen. Generally, control 
sequences include a transcriptional promoter, an optional operator sequence to control 
transcription, a sequence encoding suitable mRNA ribosomal binding sites, and 
sequences which control the termination of transcription and translation. Amplification 

10 vectors do not require expression control domains. All that is needed is the ability to 

replicate in a host, usually conferred by an origin of replication, and a selection gene to 
facilitate recognition of transformants. See, Sambrook et ah, 1990, Molecular Cloning : 
A Laboratory Manual (Cold Spring Harbor Press: New York). 

Also specifically provided by the invention are reporter expression constructs 

15 comprising a nucleic acid encoding a protein capable of expressing a detectable 

phenotype, such as the production of a detectable reporter molecule, in a cell expressing 
the construct Such constructs can be used for producing recombinant mammalian cell 
lines in which the reporter construct is stably expressed. Most preferably, however, the 
reporter construct is provided and used to induce transient expression over an 

20 experimental period of from about 18 to 96 hrs in which detection of the reporter 

protein-produced detectable metabolite comprises an assay. Such reporter expression 
constructs are also provided wherein induction of expression of the reporter construct 
is controlled by a responsive element operatively linked to the coding sequence of the 
reporter protein, so that expression is induced only upon proper stimulation of the 

25 responsive element Exemplary of such a responsive element is a cAMP responsive 

element (CRE), which induces expression of the reporter protein as a result of an 
increase in intracellular cAMP concentration. In the context of the present invention, 
such a stimulus is associated with melanocortin receptor binding, so that a reporter 
construct comprising one or more CREs is induced to express the reporter protein upon 

30 binding of a receptor agonist to a MC receptor in a recombinantly transformed 

mammalian cell. Production and use of such a reporter construct is illustrated below in 
Example 5. 
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Vectors usefulfor practicing the present invention include plasmids, viruses 
(including phage), retroviruses, and integratable DNA fragments (i.e., fragments 
integratable into the host genome by homologous recombination). The vector replicates 
and functions independently of the host genome, or may, in some instances, integrate 
into the genome itself Suitable vectors will contain replicon and control sequences 
which are derived from species compatible with the intended expression host. A 
preferred vector is the plasmid pcDNA/neo I. Transformed host cells are cells which 
have been transformed or transfected with recombinant expression constructs made 
using recombinant DNA techniques and comprising mammalian MC receptor-encoding 
sequences. Preferred host cells are human 293 cells. Preferred host cells for the MC-2 
(ACTH) receptor are Yl cells (subclone OS3 or Y6). Transformed host cells are chosen 
that ordinarily express functional MC receptor protein introduced using the recombinant 
expression construct. When expressed, the mammalian MC receptor protein will 
typically be located in the host cell membrane. See, Sambrook et aL, ibid. 

Cultures of cells derived from multicellular organisms are a desirable host for 
recombinant MC receptor protein synthesis. In principal, any higher eukaryotic cell 
culture is workable, whether from vertebrate or invertebrate culture. However, 
mammalian cells are preferred, as illustrated in the Examples. Propagation of such cells 
in cell culture has become a routine procedure. See Tissue Culture . Academic Press, 
Kruse & Patterson, editors (1973). Examples of useful host cell lines are human 293 
cells, VERO and HeLa cells, Chinese hamster ovary (CHO) cell lines, mouse Yl 
(subclone OS3), and WI1 38, BHK, COS-7, CV, and MDCK cell lines. Human 293 
cells are preferred. 

Cells expressing mammalian MC receptor proteins made from cloned genes in 
accordance with the present invention may be used for screening agonist and antagonist 
compounds for MC receptor activity. Competitive binding assays are well known in the 
art and are described in the Examples below. Such assays are useful for drug screening 
of MC receptor agonist and antagonist compounds, as detected in receptor binding 
assays as described below. 

One particular use of such screening assays are for developing drugs and other 
compounds useful in modifying or changing feeding behavior in mammals. The 
invention provides an assay system, comprising a panel of recombinant mammalian 
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cells, heterologously expressing each of the MC receptors disclosed herein, wherein the 
panel is constructed of at least one cell line expressing an MC receptor, and most 
preferably comprising cells expressing each of the MC receptors. The invention 
provides such panels also comprising a detection means for detecting receptor agonist 
or antagonist binding, such as the reporter expression constructs described herein, using 
direct binding and competition binding assays as described in the Examples below. In 
the use of this panel, each MC receptor is assayed for agonist or antagonist patterns of 
binding a test compound, and a characteristic pattern of binding for all MC receptors is 
thereby determined for each test compound. This pattern is then compared with known 
MC receptor agonists and antagonists to identify new compounds having a pattern of 
receptor binding activity associated with a particular behavioral or physiological effect. 

For example, provided herein is experimental evidence that MC-3 or MC-4 
receptor antagonists are capable of stimulating feeding in hungry animals, and that MC-3 
or MC-4 agonists are capable of inhibiting feeding in animals otherwise stimulated to 
eat. The invention provides an in vitro assay to characterize MC-3 and MC-4 
agonists/antagonists as a preliminary and economical step towards developing feeding 
behavior-modulating drugs for use in vivo. 

These results on feeding behavior in vivo have been obtained with certain MC 
receptor binding analogues, SHU91 19 and MTII. These compounds have the following 
chemical structure: 
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Generally, those skilled in the art will recognize that peptides as described 
herein may be modified by a variety of chemical techniques to produce compounds 
having essentially the same activity as the unmodified peptide, and optionally having 
other desirable properties. For example, carboxylic acid groups of the peptide, 
whether carboxyl-terminal or sidechain, may be provided in the form of a salt of a 
pharmaceutically-acceptable cation or esterified to form a C r C, 6 ester, or converted 
to an amide of formula NR,R 2 wherein R, and R 2 are each independently H or C r C 16 
alkyl, or combined to form a heterocyclic ring, such as 5- or 6-membered. Amino 
groups of the peptide, whether amino-terminal or sidechain, may be in the form of a 
pharmaceutically-acceptable acid addition salt, such as the HC1, HBr, acetic, benzoic, 
toluene sulfonic, maleic, tartaric and other organic salts, or may be modified to C r C l6 
alkyl or dialkyl amino or further converted to an amide. Hydroxy! groups of the 
peptide sidechain may be converted to C,-C l6 alkoxy or to a C,-C l6 ester using well- 
recognized techniques. Phenyl and phenolic rings of the peptide sidechain may be 
substituted with one or more halogen atoms, such as fluorine, chlorine, bromine or 
iodine, or with C r C l6 alkyl, C,-C 16 alkoxy, carboxylic acids and esters thereof, or 
amides of such carboxylic acids. Methylene groups of the peptide sidechains can be 
extended to homologous C 2 -C 4 alkylenes. Thiols can be protected with any one of a 
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number of well-recognized protecting groups, such as acetamide groups. Those skilled 
in the art will also recognize methods for introducing cyclic structures into the peptides 
of this invention to select and provide conformational constraints to the structure that 
result in enhanced binding and/or stability. For example, a carboxyl-terminal or 
5 amino-terminal cysteine residue can be added to the peptide, so that when oxidized the 

peptide will contain a disulfide bond, thereby generating a cyclic peptide. Other 
peptide cyclizing methods include the formation of thioethers and carboxyl- and 
amino-terminal amides and esters. 

Peptidomimetic and organomimetic embodiments are also hereby explicitly 

10 declared to be within the scope of the present invention, whereby the three- 

dimensional arrangement of the chemical constituents of such peptido- and 
organomimetics mimic the three-dimensional arrangement of the peptide backbone and 
component amino acid sidechains in the peptide, resulting in such peptido- and 
organomimetics of the peptides of this invention having substantial biological activity. 

15 It is implied that a pharmacophore exists for the receptor agonist and antagonist 

properties of these and related MC receptor binding analogues. A pharmacophore is 
an idealized, three-dimensional definition of the structural requirements for biological 
activity. Peptido- and organomimetics can be designed to fit each pharmacophore with 
current computer modeling software (computer aided drug design). MC receptor 

20 binding analogues derived using such software and comprising peptido- and 

organomimetics of SHU9119 and MTII and related analogues are within the scope of 
the claimed invention. 

The MC receptor binding analogues, in particular those analogues that are MC-3 
or MC-4 receptor agonists or antagonists are provided to be used in methods of 

25 influencing, modifying or changing feeding behavior in mammals in vivo. Specific 

examples of uses for the MC receptor binding analogues of the invention include but are 
not limited to treatment of eating disorders such as anorexia and obesity, and other 
pathological weight and eating-related disorders. Other examples are failure to thrive 
disorders and disease-related cachexia, such as occurs in cancer patients. Also within 

30 the scope of the analogues of the invention is use for enhancing appearance, athletic 
ability, or adjuvant to other therapies to treat disorders such as high blood pressure, high 
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serum cholesterol, vascular and heart disease, stroke, kidney disease, diabetes and other 
metabolic disorders. 

The Examples which follow are illustrative of specific embodiments of the 
invention, and various uses thereof They set forth for explanatory purposes only, and 
are not to be taken as limiting the invention. 

EXAMPLE 1 
Isolation of an aMSH Receptor Probe by Random 
PCR Amplification of Human Melanoma cDNA Using 
Degenerate Oligonucleotide ErilMES 

In order to clone novel G-protein coupled receptors, cDNA prepared from RNA 

from human melanoma cells was used as template for a polymerase chain reaction 

(PCR)-based random cloning experiment. PCR was performed using a pair of 

degenerate oligonucleotide primers corresponding to the putative third and sixth 

transmembrane regions of G-protein coupled receptors (Libert et a/., 1989, Science 244 : 

569-72; Zhou et ai, 1990, Nature MZ: 76-80). The PCR products obtained in this 

experiment were characterized by nucleotide sequencing. Two novel sequences 

representing novel G-protein-coupled receptors were identified. 

PCR amplification was performed as follows. Total RNA was isolated from a 

human melanoma tumor sample by the guanidinium thiocyanate method (Chirgwin et 

a/., 1979, Biochemistry 1£: 5294-5299). Double-stranded cDNA was synthesized from 

total RNA with murine reverse transcriptase (BRL, Gaithersburg, MD) by oligo-dT 

priming (Sambrook et al, ibid.). The melanoma cDNA mixture was then subjected to 

45 cycles of PCR amplification using 500 picomoles of degenerate oligonucleotide 

primers having the following sequence: 

Primer III (sense): 

GAGTCGACCTGTG(CVT)G(C/^ 

(SEQIDNO:l) 

and 

Primer VI (antisense): 

CAGAATTCAG(T/A)AGGGCAICCAGCAGAI(G/C)(G/A)(T/C)GAA 

(SEQIDNO:2) 
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in 100 nl of a solution containing 50 mM Tris-HCI (pH 8.3), 2.5 mM MgCl 2 , 0.01% 
gelatin, 200 \iM each dNTP, and 2.5 units of Tag polymerase (Saiki et al. % 1988, 
Science 222: 487-491). These primers were commercially synthesized by Research 
Genetics Inc. (Huntsville, AL). Each PCR amplification cycle consisted of incubations 
at 94°C for 1 min (denaturation), 45 C for 2 min (annealing), and°72 C for 2 min 
(extension). 

Amplified products of the PCR reaction were extracted with phenol/chloroform 
and precipitated with ethanol. After digestion with EcoRI and Sail, the PCR products 
were separated on a 1.2% agarose gel. A slice of this gel, corresponding to PCR 
products of 300 basepairs (bp) in size, was cut out and purified using glass beads and 
sodium iodide, and the insert was then cloned into a pBKS cloning vector (Stratagene, 
LaJolla, CA). 

A total of 172 of such pBKS clones containing inserts were sequenced using 
Sequenasc (U.S. Biochemical Corp., Cleveland, OH) by the dideoxynucleotide chain 
termination method (Sanger et al. 9 1977, Proc. Natl. Acad. Sci. USA 24: 5463-5467). 
Two types of sequences homologous to other G-protein coupled receptors were 
identified. 

EXAMPLE 2A 
Isolation of a Mouse aMSH (MC-l) Receptor cDNA 

Probes isolated in Example 1 was used to screen a Cloudman melanoma cDNA 
library in order to isolate a full-length cDNA corresponding to the cloned probe. One 
clone was isolated from a library of 5 x 1 0* clones screened as described below in 
Example 2B. This clone contained an insert of 2.6 kilobases (kb). The nucleotide 
sequence of the complete coding region was determined (see co-owned U.S. Patent No. 
5,532,347, incorporated by reference); a portion of this cDNA comprising the coding 
region was sequenced and is shown in Figures 1 A and IB (SEQ ID Nos: 3 & 4). 

EXAMPLE 2B 
Isolation of a Hnman aMSH (MC-1) Receptor cDNA 
In order to isolate a human counterpart of the murine melanocyte aMSH 
receptor gene disclosed in Example 2A and co-owned U.S. Patent No. 5,532,347, a 
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human genomic library was screened at high stringency (50% formamide, 42 °C) using 
the human PCR fragments isolated as described in Example 1 . A genomic clone was 
determined to encode an human MSH receptor (SEQ ID NO:5). The human MSH 
receptor has a predicted amino acid sequence (SEQ ID NO:6) that is 75% identical and 
colinear with the mouse ocMSH receptor cDNA sequence (Figures 2A and 2B, 
represented as human MSH-R). The predicted molecular weight of the human MSH R 
is 34.7kD. 

EXAMPLE 2C 
Isolation of a Hmnan ACTH (MC-21 Receptor cDNA 
For cloning the ACTH receptor (MC-2), a human genomic library was 
screened at high stringency (50% formamide, 1M NaCl, 50nM Tris-HCl, pH 7.5, 
0.1 % sodium pyrophosphate, 0.2% sodium dodecyl sulfate, 100/Ug/ml salmon sperm 
DNA, 10X Denhardt's solution, 42°C), using the human PCR fragments isolated as 
described in Example 1 herein and U.S. Patent No. 5,280,112, incorporated by 
reference. A genomic clone was isolated that encodes a highly related G-coupled 
receptor protein (SEQ ID NO:7 and Figures 3 A and 3B). The predicted amino acid 
sequence (SEQ ID NO: 8) of this clone is 39% identical and also colinear, excluding 
the third intracellular loop and carboxy-tcrminal tail, with the human MSH receptor 
gene product. The predicted molecular weight of this putative ACTH R is 33.9 
kilodaltons (kD). This clone was identified as encoding an MC-2 receptor based on 
its high degree of homology to the murine and human MSH receptors, and the pattern 
of expression in different tissue types, as described in Example 3 in U.S. Patent 
5,280,112. 

EXAMPLE 2D 

Nation of a Poviine ACTH fMC-2) Rapt or rt > N A 
A bovine genomic DNA clone encoding the bovine counterpart of the MC-2 
(ACTH) receptor was isolated from a bovine genomic library, essentially as described 
in Example 2C above, and its nucleotide sequence determined (as shown in Figures 
4A and 4B; SEQ ID Nos:9 & 10). 
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EXAMPLE 2E 
Isolation of a Rat y-MSH (MC-3) Receptor cDNA 

The mouse aMSH receptor cDNA isolated as described in Example 2A and co- 
owned U.S. Patent No. 5,532,347 was used to screen a rat hypothalamus cDNA library 
5 at low stringency (30% fonnamide, 5X SSC, 0. 1 % sodium pyrophosphate, 0.2% sodium 

dodecyl sulfate, 100}ig/ml salmon sperm DNA, and 10% Denhardt's solution) at 42°C 
for 1 8h. A 1 kb cDNA clone was isolated and sequenced as described in co-owned U.S. 
Patent No. 5,532,347, and this clone used to re-screen the rat hypothalamus cDNA 
library at high stringency (same conditions as above except that formamide was present 
10 at 45%). A cDNA clone approximately 2.0 kb in length was isolated and analyzed as 
described in co-pending U.S. Application Serial No. 08/044,812, incorporated by 
reference; a portion of this cDNA comprising the coding region was sequenced and is 
shown in Figures 5 A and 5B (SEQ ID Nos:l 1 & 12). 

15 EXAMPLE 2F 

Isolation of a Human MC-4 Receptor DNA 

For cloning the MC-4 receptor, a human genomic library was screened at 

moderate stringency (40% formamide, 1M NaCl, 50mM Tris-HCl, pH 7.5, 0.1% 

sodium pyrophosphate, 0.2% sodium dodecyl sulfate, 100Mg/ml salmon sperm DNA, 

20 10X Denhardt's solution, 42°C), using rat PCR fragments isolated as described in 

Example 1 herein, with the exception that the following primers were used for PCR: 

Primer II (sense): 

GAGTCGACC(A/G)CCCATGTA(C/T)T(AGT)(C/T)TTCATCTG 

(SEQ ID NO: 13) 

25 

and 

Primer VII (antisense): 

30 CAGAATTCGGAA(A/G)GC(A/G)TA(G/T)ATGA(A/G)GGGGTC 

(SEQIDNO:14) 

A genomic clone was isolated that encodes a highly related G-coupled receptor 
protein (SEQ ID NO: 15 and Figures 6A and 6B) on a 1.9kb HindUl fragment. The 
35 predicted amino acid sequence (SEQ ID NO: 16) of this clone is 55-61 % sequence 
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identity with human MC-3 and MC-5 receptors, and 46-47% sequence identity with 
the human MC-1 and MC-2 (ACTH) receptor. 

EXAMPLE 2G 
Nation of ? Mows? MC-S Receptor PISA 
One million clones from a mouse 1 29S VJ genomic library comprising 5,000,000 
clones in the AFixII vector (Stratagene) was screened at low stringency (hybridization 
in 40% formamide at 42°C, washing performed in 0.5X SSC at 60°C, as described 
above in Example 2E) using radiolabeled probed from the rat MC-3 and MC-4 receptors 
(as described in Examples 2E and 2F). Positively-hybridizing clones were isolated and 
sequenced, and the sequences obtained were compared to previously-isolated 
melanocortin receptor clones. One clone, comprising a previously-unknown sequence, 
was determined to encode the MC-5 melanocortin receptor. The nucleotide and amino 
acid sequences of this receptor are shown in Figures 7A and 7B (SEQ ID No.: 1 7 & 1 8). 

EXAMPLE 3 

Construction of a Recombinant Expression Construct, DNA Transfection 
and Functional Expression of the MCR Gene Products 

In order to produce recombinant mammalian cells expressing each of the 
melanocortin receptors of Example 2, cDNA from each receptor was cloned into a 
mammalian expression construct, the resulting recombinant expression construct 
transfected into human 293 cells, and cell lines generated that expressed the 
melanocortin receptor proteins in cellular membranes at the cell surface. 

The mouse aMSH receptor was cloned by excising the entire coding region of 
the aMSH R (MC-1) cDNA insert comprising a 2.1kb fragment and subcloning this 
fragment into the BamUVXhol sites of pcDNAI/neo expression vector (Invitrogen, San 
Diego, CA). The resulting plasmid was prepared in large-scale through one cycle of 
CsCl gradient ultracentrifugation, and 20 ^g of the plasmid transfected into each 
100mm dish of 293 cells using the calcium phosphate method (see Chen & Okayama, 
1987, /. 2: 2745-2752). After transfection, cells were cultured in DMEM media 
supplemented with 10% calf serum in a 3% C0 2 atmosphere at 37 °C. Selection was 
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performed with neomycin (G418; GIBCO) at a concentration of 1000 /ig/ml; selection 
was started 72 hr after transfection and continued for 3 weeks. 

The ccMSH R is known to couple to G-proteins and thereby activate adenyl 
cyclase, increasing intracellular levels of cAMP (see Buckley & Ramachandran, 1981, 
5 Proc. Natl. Acad. Sci. USA Zg: 7431-7435; Grahame-Smith et al, 1967, J. Biol. 
Chem. 242: 5535-5541; Mertz & Catt, 1991, Proc. Natl. Acad. Sci. USA ££: 8525- 
8529; Pawalek et al. f 1976, Invest. Dermatol. 66: 200-209). This property of cells 
expressing the aMSH receptor was used analyze expression of the aMSH receptor in 
cell colonies transfected with the expression vectors described herein as follows. Cells 

10 (~lxl0 6 ) were plated in 6-well dishes, washed once with DMEM containing 1% 
bovine serum albumin (BSA) and 0.5mM IBMX (a phosphodiesterase inhibitor), then 
incubated for 45 minutes at 37 °C with varying concentrations of the melanotropic 
peptides aMSH, PMSH, yMSH, the MSH peptide analogues Nle 4 , D-Phe 7 -aMSH 
(NDP-MSH), and ACTH. Following hormone treatment, the cells were washed twice 

15 with phosphate buffered saline and intracellular cAMP extracted by lysing the cells 
with 1ml of 60% ethanol. Intracellular cAMP concentrations were determined using 
an assay (Amersham) which measures the ability of cAMP to displace [8- 3 H] cAMP 
from a high affinity cAMP binding protein (see Gilman, 1970, Proc. Natl. Acad. Sci. 
USA £Z: 305-312). 

20 The results of these experiments are shown in Figure 8. The abscissa indicates 

the concentration of each hormone and the ordinate indicates the percentage of basal 
intracellular cAMP concentration achieved by each treatment. Points indicate the 
mean of duplicate incubations; the standard error did not exceed 15% for any data 
point. None of the peptides tested induced any change in intracellular cAMP in cells 

25 containing the vector alone. Cells expressing the murine aMSH receptor responded 
to melanotropic peptides with a 2-3 fold elevation of intracellular cAMP, similar to 
levels of cAMP induced by these peptides in the Cloudman cell line (see Pawalek, 
1985, Yale J. Biol. Med. S£: 571-578). The EC 50 values determined for aMSH 
(2.0x10^), ACTH (8.0xltf M) and the superpotent MSH analogue NDP-MSH 

30 (2.8xlO* n M) correspond closely to reported values (see Tatro et al. % 1990, Cancer 
Res. SQ: 1237-1242). As expected, the PMSH peptide had an EC^ value comparable 
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to aMSH, while yMSHliad little or no activity (see Slominski et aL, 1992, Life Sci. 
5Q: 1103-1108), confirming the identity of this receptor as a melanocyte ctMSH 
receptor. 

A similar series of experiments were performed using mouse Yl cells 
(subclone OS3; Schimmer et al. 9 1995, J. Cell. Physiol. 164-171) expressing the 
human and bovine MC-2 (ACTH) receptor clones of Examples 2C and 2D. These 
results are shown in Figure 9, where the extent of cAMP responsive element-linked 
p-galactosidase activity (see Example 4, below) is shown with increasing 
concentrations of ACTH. 

The entire coding region of the MC-3 receptor cDNA insert, obtained as 
described above in the co-pending U.S. Serial No. 08/044,812, was contained in a 2.0kb 
restriction enzyme digestion fragment and was cloned into the BamKUXhol sites of 
pcDNA/neo I expression vector (Invitrogen, San Diego, CA). The resulting plasmid was 
prepared in large-scale through one cycle of CsCl gradient ultracentrifugation and 20 jig 
pcDNA/MC-3 receptor DNA were transfected into each 100 mm dish of 293 cells using 
the calcium phosphate method (see Chen & Okayama, 1987, Mol Cell Biol. 2: 2745- 
2752). After transfection, cells were cultured in DMEM media supplemented with 10% 
calf serum in a 3% C0 2 atmosphere at 37°C. Selection was performed with neomycin 
(G418; GIBCO) at a concentration of 1000 ng/ml; selection was started 72 h after 
transfection and continued for 3 weeks. 

Specific binding of melanocortin peptides to cells expressing the MC-3 receptor 
was demonstrated by competition experiments using ,25 l-labeled Nle 4 -D-Phe 7 -a-MSH 
(NDP-MSH, as described in Tatro et aL, 1990, Cancer Res. 5Q: 1237-1242). Suspended 
cells (2x10 s ) were incubated at 37°C with 500,000 cpm of labeled peptide for 1 0 min in 
binding buffer (Ham's F10 media plus lOmM HEPES, pH 7.2, 0.25% bovine serum 
albumin, 500K IU/ml aprotinin, 100 ng/ml bacitracin and ImM 1,10-phenanthroline) 
in the presence or absence of the indicated concentrations of peptides. Maximum 
labeling was achieved within 10 min. 

The results of these experiments are shown in Figure 10. Labeled NDP-MSH 
binding to cells expressing the MC-3 receptor, produced as described above, is inhibited 
by competition with unlabeled peptides known to be melanocortin receptor agonists, 
having a relative order of potency as follows: 
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NDP-MSH > Y-MSH > a-MSH > ACTH^o »> ORG2766. 
Approximate K, values derived from this experiment are as shown in Table I: 



5 



TABLE I 


| Agonist 


" ^(approx) 


NDP-MSH 


2x10"* 


j y-MSH 


5 x 10 8 


a-MSH 


1 x 10- 7 


1 ACTHj_, n 


8 x 10- 5 



cAMP production assays as described above were also used to analyze 
expression of MC3-R in cells transfected with the expression vectors described herein 
as follows. Cells (~5xl0 6 ) were plated in 6-well dishes, washed once with DMEM 

15 containing 1% bovine serum albumin (BSA) and 0.5mM IBMX (a phosphodiesterase 

inhibitor), then incubated for lh at 37°C with varying concentrations of the melanofropic 
peptides aMSH, y 3 MSH, yMSH, the MSH peptide analogues Nle 4 -D-Phe 7 -aMSH 
(NDP-MSH), ACTH^o and ACTH,.„. Following hormone treatment, the cells were 
washed twice with phosphate buffered saline and intracellular cAMP extracted by lysing 

20 the cells with 1ml of 60% ethanoL Intracellular cAMP concentrations were determined 
using an assay which measures the ability of cAMP to displace [8- 3 H] cAMP from a 
high affinity cAMP binding protein (see Gilman, 1979, Proc. Natl Acad. Sci. USA £2: 
305-312). 

The results of these experiments are shown in Figures 1 1 A through 1 1C. The 
25 abscissa indicates the concentration of each hormone and the ordinate indicates the 

percentage of basal intracellular cAMP concentration achieved by each treatment. Points 
indicate the mean of duplicate incubations; the standard error did not exceed 15% for 
any data point. Figure 1 1 A depicts the results of experiments using peptides found in 
vivo; Figure 1 IB depicts results found with y-MSH variants; and Figure 1 1C shows 
30 results of synthetic melanocortin analogues. None of the peptides tested induced any 
change in intracellular cAMP in cells containing the vector alone. Cells expressing rat 
MC3-R responded strongly to every melanotropic peptide containing the MSH sequence 
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His-Phe-Arg-Trp, with" up to a 60-fold elevation of intracellular cAMP levels. EC 50 
values ranged from 1-50 nM. The most potent ligand and the one having the lowest 
EC 50 was found to be yMSH. The order of potency for the naturally occurring 
melanocortins was found to be: 

Y 2 -MSH = yMSH > aMSH = ACTH,. 39 > y 3 -MSH > rfes-acetyl-aMSH > ACTH^o. 
Ec 50 values for these compounds are shown in Table II: 



TABLE II 



Agonist 




NDP-MSH 


1 x 10-' 


Y.-MSH 


3 x 10- 9 


Y 2 -MSH 


3x 10-' 


a-MSH 


4x10-' 


ACTH,.3, 


4x 10-' 


Y3-MSH 


6x 10" 9 


desacetyl-aMSH 


8xl0- 9 


ACTH. .„ 


1 x lO" 7 



Additionally, a synthetic melanocortin peptide (ORG2766), known to have the 
greatest activity in vivo in stimulation of retention of learned behavior and in stimulation 
of neural regeneration, was unable to stimulate MC3-R-mediated cAMP production, and 
was also inactive as an antagonist. The results strongly indicate that this peptide does 
not bind to MC3-R protein. 

The MC-4 receptor was cloned in a 1 .9kb HindHI genomic DNA fragment after 
PCR amplification of a lambda phage clone into pcDNAI/Neo (Invitrogen). This 
plasmid was stably introduced into human 293 cells by calcium phosphate co- 
precipitation using standard techniques, and plasmid-containing cells selected in G41 8 
containing media. Specificity of receptor-hormone binding was assayed using adenylate 
cylcase activity as described above. The MC-4 receptor was found to couple to 
adenylate cyclase activity having the following pattern of agonist affinity: 

NDP-MSH > des-acetyl-a-MSH >/= ACTH,_ 39 >/= a-MSH > > y 2 -MSH = ACTH^o 



-27- 



WO 98/10068 



PCT/US97/15565 



whereas the synthetic ACTH^, analogue ORG2766 showed no detectable binding to the 
MC-4 receptor. The results of adenylate cyclase activity assays are shown in Figure 12. 
ECsq values for each of the tested MC-4 receptor agonists are as shown in Table III: 



TABLE III 



Agonist 


issa 


NDP-MSH 


1.1 x 10"M 


Jasacetyl-ccMSH 


4.9x 10 10 M 


ACTH,.j, 


6.8 x 10 ,0 M 


cc-MSH 


1.5X10-1VI 


Y2-MSH 


> 10" 7 M 


ACTH 4 , 0 .. __ 


>io- 7 



A 1.6kb Apal-Hindlll fragment comprising the entire coding sequence of the 
mouse MC-5 melanocortin receptor disclosed in Example 2G above was cloned into the 
pcDNA/neo expression vector (Invitrogen) after PCR amplification of the lambda phage 
clone. This plasmid was stably introduced into human 293 cells by calcium phosphate 
co-precipitation using standard techniques, and plasmid-containing cells selected in 
G418 containing media. Specificity of receptor-hormone binding was assayed using 
adenylate cylcase activity as described above. The MC-5 receptor was found to couple 
to adenylate cyclase activity having the following pattern of agonist affinity: 

ct-MSH > pMSH > > y-MSH 
The results of adenylate cyclase activity assays are shown in Figure 1 3. EC 50 values for 
each of the tested MC-5 receptor agonists are: a-MSH=1.7 x 10' 9 M; and pMSH = 5 x 
lO'M. 

EXAMPLE 4 

Melanocortin Analogue Binding to Mammalian Melanocortin Receptors 

Recombinant cells prepared as described above in Example 3 were used to 
characterize receptor binding of two melanocortin analogues comprising cyclic lactam 
heptapep tides. 
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The melanocortin receptor analogue SHU9119 has the following chemical 
structure: 



0 




Ac-Nle 4 -cyclo(Asp 5 , D-Nal(2) 7 , Lys'°) aMSH-(4-10)-amide 
The melanocortin receptor analogue MTII has the following chemical structure: 



O 




Ac-Nle 4 -cyclo(Asp 5 , His 6 , D-Phe 7 , Arg 8 , Trp' , Lys'°) ccMSH-(4-1 0)-amide 
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These analogues were prepared as described in Hruby et aL (1 995, J. Med. Chem. 
21: 3454-3461). 

These analogues were tested for melanocortin receptor binding using a 

colorimetric assay system developed by some of the instant inventors (Chen et aL, 1 995, 

5 Analyt. Biochem. 22£t* 349-354) as follows. A series of concatamers of the synthetic 

oligonucleotide: 

5 '-GAATTCG ACGTC AC AGTATGACGGCC ATGG-3 1 
(SEQ ID No: 19) 

10 was produced by self-annealing and ligation and a tandem tetramer obtained. This 
fragment was cloned upstream of a fragment of the human vasoactive intestinal peptide 
(-93-+ 1 52; SEQ ID No.: 1 3; see Fink et aL, 1 988, Proc Natl. Acad. Sci. USA i£: 6662- 
6666). This prompter was then cloned upstream of the P-galactosidase gene from E. 
coli. The resulting plasmid construct is shown in Figure 14. 

15 Transient transfection of the pCRE/p-gal plasmid described above was 

performed as follows. Cells grown to between 40-60% confluency (corresponding to 
about 1.5 million cells/6cm tissue culture plate) were incubated with Opti-MEM 
(GIBCO-BRL, Long Island, NY) and then contacted with a pCRE/p-gal-lipofectin 
complex which was prepared as follows. 3ng plasmid DNA and 2Q\iL lipofectin reagent 

20 (GIBCO) were each diluted into 0.5mL Opti-MEM media and then mixed together. This 
mixture was incubated at room temperature for 15-20 min., and then the mixture (ImL) 
added to each 6cm plate. Transfected plates were incubated at 37 °C for 5-24h, after 
which the plates were washed and incubated with DMEM media (GIBCO) and the cells 
split equally into a 96-well culture plate. 

25 To assay melanocortin receptor analogue binding, human 293 cells expressing 

each of the melanocortin receptors MC-1, MC-3, MC-4 and MC-5, and mouse Yl cells 
expressing the MC-2 receptor, were transiently transfected with pCRE/p-gal as described 
above and assayed as follows. Two days after transfection, cells were stimulated with 
hormones specific for each receptor or hormone analogue by incubation for 6h at 37 °C 

30 with a mixture comprising 1 0* 12 - 1 0"^) hormone or analogue, 0. 1 mg/mL bovine serum 
albumin and O.lmM isobutylmethylxanthine in DMEM. The effect of hormone or 
analogue binding was determined by P-galactosidase assay according to the method of 
Feigner et aL (1994, J. Biol Chem. 262: 2550-2561). Briefly, media was aspirated from 



-30- 



WO 98/10068 PCT/US97/15565 

culture wells and 50^iL lysis buffer (0.25M Tris-HCl, pH 8/0.1% Triton-XlOO) added 
to each well. Cell lysis was enhanced by one round of freezing and thawing the cell/lysis 
buffer mixture. 1 OjiL aliquots were sampled from each well for protein determination 
using a commercially-available assay (BioRad, Hercules, CA). The remaining 40\iL 
5 from each well was diluted with 40fiL phosphate buffered saline/0.5% BSA and 1 50\iL 
substrate buffer (60mM sodium phosphate/ ImM MgCl 2 / lOmM KC1/ 5mM P- 
mercaptoethanol/ 2mg/mL o-nitrophenyl-P-D-galactopyranoside) added. Plates were 
incubated at 37°C for lh and then absorbance at 405nm determined using a 96-well plate 
reader (Molecular Devices, Sunnyvale, CA). A series of two-fold dilutions from 20ng 

10 of purified P-galactosidase protein (Sigma Chemical Co, St. Louis, MO) were assayed 
in parallel in each experiment to enable conversion of OD^ to known quantity of P- 
galactosidase protein. 

The results of these experiments are shown in Figure 15. This Figure shows the 
results of the p-galactosidase assay described above using cells expressing each of the 

1 5 MC- 1 , MC-3, MC-4 or MC-5 receptors and contacted with aMSH or a variety of aMSH 
analogues, including SHU91 19. These results showed that SHU91 19 had relatively 
weak agonist activity for both the human MC-3 and MC-4 receptors. 

These results demonstrated the development of a colorimetric assay for cAMP 
accumulation as the result of melanocortin receptor binding to agonists and antagonists. 

20 The action of MTII, SHU91 19, and the endogenous mouse agouti peptide as 

agonists or antagonists of rodent MC receptors was first determined by examining their 
ability to elevate intracellular cAMP in 293 cell lines expressing the rat MC3-R or 
mouse MC4-R (expressed as IC 50 values representing ligand concentration required for 
half-maximal inhibition of binding of (M25)-(Nle 4 , D-Ph 7 e)a-MSH tracer). 

25 Agonist/antagonist activity was also shown by demonstrating inhibition of cAMP 
elevation by the potent a-MSH analogue [Nle\ D-Phe 7 ]a-MSH, using either a cAMP- 
responsive P-galactosidase reporter construct as described above, or by direct adenyl 
cyclase assay as described in Example 3 (wherein EC 50 values represent ligand 
concentration required for half-maximal activation of a cAMP-responsive P- 

30 galactosidase reporter). Competition binding experiments were determined as the 
amount of radioactivity bound in the presence of 5xlO" 6 M unlabeled [Nle 4 , D-Phe 7 ]a- 
MSH, and was typically 3-5% of total counts bound. 
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In these experiments, murine agouti peptide was produced using a baculovirus 
system as described by Lu et al (1994, Nature 371: 799-802), with the modification that 
the agoutipepXidz was purified from baculovirus supematants by 0.6M NaCl step elution 
from an EconoS cation exchange column (BioRad). Agouti peptide used in these assays 
5 was approximately 60% pure. 

Competition binding assays were performed to determine whether SHU91 1 9 had 
antagonist activity towards aMSH binding to either the MC-3 or MC-4 receptors. These 
assays were performed as follows. Human 293 cells (100,000 cells/well in 24-well 
plates) expressing either the MC-3 or MC-4 receptors prepared as described above were 

10 incubated with a solution of lmg/mL BS A in PBS containing 1 00,000cpm (3.1x1 0~'°M 
[ ,25 I](Nle 4 , D-Phe 7 )ccMSH and varying concentrations of aMSH, (Nle\ D-Phe 7 )aMSH 
or SHU91 19. Cells were incubated for 30min at 37°C, washed twice with PBS-BSA, 
lysed with 0.5mL 0.5N NaOH, and counted using a y -counter to quantitate the amount 
of bound [ l25 I](Nle 4 , D-Phe 7 )aMSH. Control experiments showed non-specific binding 

15 to occur at about 3-5% levels, and this was taken into account when analyzing the 

experimental results. 

The SHU91 19 analogue was found to be a potent antagonist of both the human 
MC-3 and MC-4 receptors, as shown in Figure 16. These assays showed pA 2 values of 
8.3 and 9.3 for the human MC-3 and MC-4 receptors, respectively, as determined using 

20 the method of Schild (1 947, Brit. J. Pharmacol 2: 1 89-206). In contrast, no significant 
alteration in IC 50 values was detected in binding experiments using this analogue with 
either the MC-3 or MC-4 receptors (Figure 17). 

The activity of the MTII analogue was also assayed for melanocortin receptor 
agonist activity. These results are shown in Figures 18A and 18B, and confirmed that 

25 the MTII analogue is a specific agonist of the MC-3 and MC-4 receptors. 

Specific competition of [Nle\D-Phe 7 ]a-MSH binding to rat MC-3 receptor by 
agouti peptide was observed, although accurate IC 50 values could not be determined 
because the peptide preparation was not homogenous (results not shown). Specific 
competition of a-MSH activation of human MC4-R by agouti was previously disclosed 

30 (Lu et a/., 1994, Nature 221: 799-802). 
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EXAMPLE 5 

Feeding Behavior Effect of Melanoeortin Analogue Binding in Brain 

The results shown in Example 4 above suggested a role in the regulation of 
feeding behavior in mammalian brain for MC receptor agonists and antagonists, in view 
of the antagonist properties of the agouti peptide at the MC-3 and MC-4 receptors. The 
agouti peptide was known to cause obesity when expressed ectopically in the mouse, and 
has been found to be a high affinity antagonist of the melanocyte stimulating hormone 
receptor (MC1-R) and of the hypothalamic MC-4 receptor (see Lu et aL, ibid,). The 
former activity explained the inhibitory effect of the agouti peptide on eumelanin 
pigment synthesis. Similarly, it was hypothesized by the inventors that agouti causes 
obesity in mice by antagonizing hypothalamic MC-4 receptors. The cyclic melanocortin 
analogue, SHU91 19, having been shown herein and elsewhere (Hruby et al) to be a 
specific, high affinity antagonist of the central MC-3 and MC-4 receptors, was tested for 
the effect of direct administration to mouse brain on feeding behavior in the animals. 
Intracerebroventricular (ICV) administration of SHU91 19 was performed to avoid any 
complications caused by inhibition of peptide traverse of the blood-brain barrier. 

Briefly, male C57B1/6J mice (1 8-29g) were maintained on a normal 12hr/12hr 
light dark cycle with food (Purina mouse chow) and water ad libitum. Animals were 
housed individually for 24 hrs, distributed into experimental and control groups, 
avoiding any bias as a function of prior weight, then injected with vehicle or vehicle plus 
drug just prior to the onset of a 12hr light or dark cycle. Fasted animals were deprived 
of food from 18:00 to 10:30 hrs to stimulate feeding during the daytime experimental 
period. Animals were lightly anesthetized with halothane, and administered into one 
lateral ventricle 2 fiL of a solution of artificial cerebrospinal fluid alone (acsf, 
comprising 130mM NaCl, 27mM NaHC0 3 , 1.2mM Njt HPQ , 0.3mM Na^I Pp , 
0.5mM Na^O,, 1 .OmM CaCl 2 , 1 .OmM MgCl 2 , and 2.5mM KC1), or 6nmo! SHU91 1 9 
in acsf. Freehand injections were performed as described by Laursen and Belknap 
(1986, J. Pharmacol. Methods 355-357) with some modifications. A lOfil luertip 
syringe (Hamilton 701LT) was fitted with a 0.5 inch 27 gauge needle. Stiff tygon tubing 
was slipped over the needle to expose 3mM of the needle tip. The syringe was held at 
a 45 ° angle from the front of the skull with the bevel facing up. The coronal suture was 
found by lightly rubbing the needle over the skull. Maintaining the 45 0 angle, the needle 
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was then inserted l-2mm lateral to the midline, using only mild pressure to insert and 
remove the needle. The compounds indicated in a 2^1 volume of acsf were administered 
slowly over approximately 15 seconds, and the needle removed after 35 seconds. 
Animals were allowed to recover from anesthesia and placed into a cage containing a 
5 premeasured quantity of food pellets in a spill-free cup. Moribund animals were not 

included in the study. 

Stimulation of feeding by intracerebroventricular administration of the 
melanocortin antagonist SHU91 19 is shown in Figures 19A through 19C. Curves show 
cumulative food intake as a function of time following administration of the substances 

1 0 shown. Figure 1 9A shows stimulation of feeding by administration of SHU9 1 1 9 just 
prior to lights ofT(19:00 hrs) to C57B1/6J mice fed ad libitum. Figure 19B, in contrast, 
shows no effect of morning (10:00 hrs) SHU91 19 administration in C57B1/6J mice fed 
ad libitum. Figure 19C illustrates stimulation of daytime feeding by SHU9119 
administration in fasted C57B1/6J mice. In deriving the data points comprising these 

15 Figures, food remaining was briefly removed and weighted at the time intervals 

indicated. Data points indicate the mean and bars indicate standard error. Significance 
of the effect over time was determined by ANOVA with repeated measures. 
Significance of drug effects at individual time points was determined by two-way 
ANOVA, and is indicated in each Figure (***=P<0.001, **=P<0.01, *=P<0.05). 

20 These results demonstrated that ICV administration of SHU9 1 1 9 into one lateral 

ventricle of the C57B1/6J mouse just prior to lights out led to a mean 60% increase in 
food intake over 12 hrs (Figure 19A; PO.005). In contrast, daytime food intake in 
animals fed ad libitum was not stimulated by administration of SHU91 19 (Figure 19B). 
SHU91 19-treatment did, however, significantly stimulate daytime food intake in animals 

25 fasted for 16 hrs prior to the experiment (Figure 19C; PO.001 ). Stimulation of feeding 

was evident at approximately two hrs post-treatment, and continued for 12 hrs, to 
produce a mean 34% in food intake relative to vehicle-injected controls. 

These results supported the hypothesis that agouti and/or SHU91 19 stimulate 
feeding by antagonizing MC receptors in the central nervous system. To further test this 

30 hypothesis, a series of experiments were performed wherein MC receptor agonists were 
administered to animals primed by fasting to eat, to determine whether feeding in such 
animals could be inhibited by the MC receptor agonists. Animals were induced to feed 
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by food deprivation for 16h prior to ICV administration of the non-specific melanocortin 
agonist MTU In these experiments, ICV injections in male C57B1/6J mice (20-30g) 
and the measurement of food intake were performed as described above. 

Results of these experiments are shown in Figures 20A through 20C. In 
comparison to vehicle-injected animals, MTII was found to produce a potent inhibition 
of feeding within one hour after administration (Figure 20A) in a dose-responsive 
manner. Food intake was significantly inhibited for up to four hours following 
administration (PO.001) at the highest dose administered (3nmol), and decreased food 
intake continued for the next four hours with normal rates of food intake resuming at 
about 8 hours after treatment. This dose-responsive inhibition of feeding had an IC 50 at 
the two hour time point of approximately O.Snmol (Figure 20B). However, inhibition 
of feeding with 3nmol MTII was completely blocked by co-administration of 6nmol 
SHU91 19 (Figure 20C; P<0.001), demonstrating that the effect results specifically from 
agonist binding to the MC-4 and/or MC-3 receptor. 

Locomotor assays were performed to determine whether the effects on feeding 
behavior observed in these mice were secondary to generalized behavioral effects caused 
by administration of these melanocortin analogues. The effects of MTII on locomotor 
activity were tested by placing vehicle or MTII-treated mice in sound and light-proof 
cages containing multiple light beam detectors. These assays were performed by first 
injecting 3nmol MTII or acsf as described above. At three hours (2:45-3:25) post- 
injection, 12 mice were placed into 12 separate boxes containing multiple infrared light 
sources and photodetectors. The boxes were contained within separate ventilated light 
and sound attenuating chambers (Coulbourn model El 0-20). Disruption of the infrared 
beams, with a 10msec resolution, was tallied independently for each one minute time 
period in each cage. The results of these assays are shown in Figure 20D. Data points 
indicate the mean total activity (# of light breaks) for 6 animals in each experimental 
group. Four way ANOVA statistical analysis was used to analyze the data, and indicated 
an absence of a significant difference among the two groups. 

Inhibition of feeding by MTII could not b6 explained by any apparent behavioral 
abnormalities, or any effect on arousal or locomotor activity. MTII-treated animals 
appeared alert and exhibited no unusual behavior relative to controls. At approximately 
three hours after ICV administration, MTII-treated animals exhibited locomotor activity 
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that was indistinguishable from vehicle-treated animals (Figure 20D). The higher initial 
activity, indicative of exploratory behavior, and continued locomotion over a 15 min 
period was indistinguishable between the two groups, indicating that the inhibition of 
feeding was not due to decreased locomotion or decreased arousal. 
5 The administration of MTII also inhibited food intake in three other models of 

hyperphagia: the C57B\/65-Lep ob mouse, a neuropeptide Y (NPY)-injected C57B1/6J 
mouse and a C5TBl/6J-A y mouse. Figure 21A shows inhibition of feeding by 
intracerebroventricular administration of MTU in A Y mice (females, 1 9-28gms). Figure 
2 IB shows inhibition of feeding by intracerebroventricular administration of MTII in 

10 C57B1/6J mice (females, 21-25gm) stimulated to feed by co-administration of NPY. 

Figure 21 C shows inhibition of feeding by intracerebroventricular administration of the 
MTII in ob/ob mice (females, 48-69 gins). Figure 2 ID shows inhibition of feeding in 
ob/ob mice by intraperitoneal administration of MTII (females, 40-45 gnis). ICV 
injections and measurement of food intake were perfoimed as described above, with the 

15 exception of NPY treated animals, which were not fasted prior to experimentation. 

Animals treated intraperitoneal ly received lOOfxl of a ImM solution of MTII in saline, 
and vehicle injections consisted of the same amount of saline alone. Significance 
indicated for individual time points, determined as described above, was for 3nmol MTII 
vs. acsf (Figure 21 A), 1.18 nmol NPY vs. 1.18 nmol NPY + 3 nmol MTII (Figure 21B), 

20 3nmol MTII vs. acsf (Figure 2 1 C), and 1 00 nmol MTII vs. saline (Figure 2 1 D). 

The hyperphagia in these models can be clearly seen by comparing the 1 2 hr food 
intake following a fast in vehicle-injected C57B1/6J (2.4g, Figure 19A), C57Bl/6J-^ r 
(3.7g, Figure 21 A) and 05761/61-1^ (3.7g. Figure 2 1C) animals. As expected, MTII 
treatment inhibited food intake following a 16 hr fast in the C57B 1/6 hA Y mouse (Figure 

25 2 1 A; P<0.05). Interestingly, while food intake for the first four hours is significantly 

inhibited relative to vehicle-injected animals, it is also significantly less inhibited in the 
C57Bl/6J-i4 y animal than in the C57B1/6J animal given the same 3nmol dose (compare, 
Figure 20A versus Figure 21 A, 1-4 hrs; PO.001). The decreased effectiveness of the 
agonist in the presence of the A Y allele is consistent with the proposal that this allele 

30 results in chronic expression of agouti peptide melanocortin antagonist in the brain. 

MTII, upon co-administration, also significantly inhibited the profound 
stimulation of feeding induced by NPY, measured over a three hr period (Figure 21C; 
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PO.005). Co-administration of an approximately two-fold molar excess of MTII 
produced a 74% inhibition of NPY-stimulated food intake at the three hour time point. 

Finally, MTII also inhibited hyperphagia due to absence of leptin in the 
C57BV6J-Lep ob mouse (Figure 21C; PO.001). MTII potently blocked feeding (Figure 
20A) in these animals, in contrast to the less potent inhibition described above for the 
C57Bl/6J-^ ) ' mouse. 

The C57B1/6J-Z^* animal was also used to test the ability of MTII to regulate 
feeding when administered peripherally. Moderate doses (lOOnmol) of MTII inhibited 
feeding in the C57B1/6J-Z,ep°* mouse (PO.001) while low doses (lOnmol) did not (date 
not shown). The kinetics were similar to those seen with ICV administration, with a 
potent inhibition of feeding for the first four hours. The 1 00-fold higher dose required 
peripherally, as well as the similar kinetics, suggest a primarily central nervous system 
-based mechanism of action of MTII. 

These data show that melanocortinergic neurons exert a tonic inhibition of 
feeding behavior, and that disruption of this signal leads to hyperphagia. With regard 
to the recently-discovered leptin hoimone made by adipocytes, which is generally 
expressed at elevated levels in obese humans and rodents (such as the C57Bl/6J-JLe/?° 6 
animal), the regulatory defect is understood to be an incapacity to respond properly to 
the leptin hormone signal. The instant results indicate that the melanocortins act 
independently, and physiologically "downstream," from the leptin hormone/receptor 
interaction, because it has been shown herein that melanocortin receptor agonists can 
potently inhibit feeding in the C57B1/6J-Z,ep° 6 animal. 

These results suggest that MC receptor agonists and antagonists can affect 
mammalian feeding behavior, and provide a means for determining candidate 
compounds for the development of effective pharmacological products directed towards 
alleviating such human ailments as obesity, anorexia and cachexia. 

EXAMPLE 6 

Use of MC Receptor-Expressing Recombinant Cells for Screening Compounds 
that Affect Feeding Behavior in Mammals 

The results obtained in Example 5 indicated that cells expressing a variety of 

mammalian melanocortin receptors are useful for characterizing compounds as a first 
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step towards developing MC receptor agonists and antagonists for controlling feeding 
behavior in mammals, particularly obesity and overweight disorders in general, as well 
as anorexia, cachexia and other failure-to-thrive disorders. 

A panel of mammalian melanocortin receptor-expressing recombinant cells are 
5 provided as described above in Example 3, wherein each member of the panel comprises 
appropriate mammalian cells, such as human 293 cells, comprising a recombinant 
expression construct encoding the MC-1, MC-2 (ACTH), MC-3, MC-4 or MC-5 
receptor, the panel constructed to comprise cells functionally expressing each of these 
MC receptor proteins. 

10 The panel is used as follows. Receptor agonist activity is assayed by transient 

or stable expression of a protein which produces a metabolite reporter molecule in 
response to receptor binding by any of the MC receptor proteins. An example of such 
a reporter system is the recombinant expression construct described in Example 4, 
wherein cAMP responsive elements (CREs) are operatively linked to a bacterially- 

15 derived P-galactosidase (P-gal) gene. In the event of receptor binding, cAMP is 

produced in the mammalian cell, and the CRE induces (J-gal expression. When co- 
incubated with a colorless substrate for P-gal, receptor binding results in conversion of 
the colorless substrate to a blue-colored product, which can be easily scored visually or 
spectrophotometrically. Alternative reporter genes, such a luciferase, can also be used 

20 as reporter systems, provided that expression of the reporter molecule-producing protein 
is functionally linked to receptor binding of a test compound. Alternatively, cAMP 
production resulting from MC receptor binding can also be measured directly. 

Assay panels are arranged so that agonist activity can be identified, quantitated 
and correlated with expression of each MC receptor. Automated assays using such 

25 panels are also envisioned, whereby the qualitative and quantitative detection of a 

reporter metabolite is detected in an array (such as a 96-well tissue culture plate) and the 
data collected and assembled into a computer data-base or other analytical program. 

Antagonist activity is detected by a modification of the above assay. In this 
assay, the inhibition of cAMP production by a standardized amount of a known receptor 

30 agonist, specific for each receptor, is assayed in the presence of a putative antagonist 

compound. Production of metabolite reporter molecules and their qualitative and 
quantitative detection is achieved as described above, and the specificity and potency of 
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each antagonist compound characterized with regard to the degree of inhibition achieved 
for each receptor. 

In view of the instant disclosure, MC-3/MC-4 receptor antagonists are expected 
to be useful to inhibit food intake in a hungry animal, and MC-3/MC-4 receptor agonists 
are expected to be useful to increase food intake in an animal. Alternative patterns of 
feeding behavior associated with different patterns of MC receptor agonist/antagonist 
activity can be determined using this assay. 

Compounds having agonist or antagonist activity with the MC-3 or MC-4 
receptors detected using this assay are further screened in vivo to determine whether the 
observed receptor binding activity results in modification of feeding behavior when 
administered to an animal. In these assays, the MC receptor binding compounds 
detected using the assay are administered intracranioventricularly as described above in 
Example 5 to animals after an overnight fast, to waking animals, or to animals that are 
not otherwise primed to be hungiy. Feeding and locomotor activity is monitored in these 
animals, and compounds affecting eating behavior (either by inhibiting feeding in 
otherwise hungry animals or stimulating feeding in otherwise sated animals) are selected 
for further development. 

In addition, systemic administration of compounds found to be active by ICV 
administration assays is used to screen such compounds for the ability to cross the blood- 
brain barrier. Such compounds are also useful as templates for modifications aimed at 
increasing the availability of these compounds in the brain after systemic administration, 
for increasing bioactivity, or both. 

It should be understood that the foregoing disclosure emphasizes certain specific 
embodiments of the invention and that all modifications or alternatives equivalent 
thereto are within the spirit and scope of the invention as set forth in the appended 
claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Oregon Health Sciences University 

(B) STREET: 3181 S.W. Sam Jackson Park Road 

(C) CITY: Portland 

(D) STATE: Oregon 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 97201 

(G) TELEPHONE: 503-494-8200 

(H) TELEFAX: 503-494-4729 

(ii) TITLE OF INVENTION: Methods and Reagents for Discovering and 

Using Mammalian Melanocortin Receptor Agonists and Antagonists 
To Modulate Feeding Behavior in Animals 

(iii) NUMBER OF SEQUENCES: 19 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(v) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY : mics_feature 

(B) LOCATION: 1..35 

(D) OTHER INFORMATION: /function = Regenerate 
oligonucleotide primer (sense)" 
/note- "The residue at positions 24 and 24 are 
inosine" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GAGTCGACCT GTGYGYSATY RCNNTKGACM GSTAC 



PCT/US97/15565 



35 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: mics_f eature 

(B) LOCATION: 1..32 

(D) OTHER INFORMATION: /function = "Degenerate 
oligonucleotide primer (antisense)" 
/note= n The residue at position 18 is inosine" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

CAGAATTCAG WAGGGCANCC AGCAGASRYG AA 32 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(ix) FEATURE: 

(A) NAME /KEY: 5'UTR 

(B) LOCATION: 1 . . 14 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 15.. 959 

(ix) FEATURE: 

(A) NAME/ KEY : 3 ' UTR 

(B) LOCATION: 960.. 1260 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TTCCTGACAA GACT ATG TCC ACT CAG GAG CCC CAG AAG AGT CTT CTG GGT 50 
Met Ser Thr Gin Glu Pro Gin Lys Ser Leu Leu Gly 
1 5 10 

TCT CTC AAC TCC AAT GCC ACC TCT CAC CTT GGA CTG GCC ACC AAC CAG 98 
Ser Leu Asn Ser Asn Ala Thr Ser His Leu Gly Leu Ala Thr Asn Gin 
15 20 " 25 

TCA GAG CCT TGG TGC CTG TAT GTG TCC ATC CCA GAT GGC CTC TTC CTC 146 
Ser Glu Pro Trp Cys Leu Tyr Val Ser lie Pro Asp Gly Leu Phe Leu 
30 35 40 

AGC CTA GGG CTG GTG AGT CTG GTG GAG AAT GTG CTG GTT GTG ATA GCC 194 
Ser Leu Gly Leu Val Ser Leu Val Glu Asn Val Leu Val Val lie Ala 
45 50 55 60 

ATC ACC AAA AAC CGC AAC CTG CAC TCG CCC ATG TAT TAC TTC ATC TGC 242 
lie Thr Lys Asn Arg Asn Leu His Ser Pro Met Tyr Tyr Phe lie Cys 
65 70 75 

TGC CTG GCC CTG TCT GAC CTG ATG GTA AGT GTC AGC ATC GTG CTG GAG 2 90 

Cys Leu Ala Leu Ser Asp Leu Met Val Ser Val Ser lie Val Leu Glu 
80 85 90 

ACT ACT ATC ATC CTG CTG CTG GAG GTG GGC ATC CTG GTG GCC AGA GTG 338 
Thr Thr lie lie Leu Leu Leu Glu Val Gly lie Leu Val Ala Arg Val 
95 100 105 

GCT TTG GTG CAG CAG CTG GAC AAC CTC ATT GAC GTG CTC ATC TGT GGC 386 
Ala Leu Val Gin Gin Leu Asp Asn Leu lie Asp Val Leu lie Cys Gly 
110 115 120 

TCC ATG GTG TCC AGT CTC TGC TTC CTG GGC ATC ATT GCT ATA GAC CGC 434 
Ser Met Val Ser Ser Leu Cys Phe Leu Gly lie lie Ala lie Asp Arg 
125 130 135 140 



TAC ATC TCC ATC TTC TAT GCG CTG CGT TAT CAC AGC ATC GTG ACG CTG 4 82 

Tyr lie Ser lie Phe Tyr Ala Leu Arg Tyr His Ser lie Val Thr Leu 
145 150 155 

CCC AGA GCA CGA CGG GCT GTC GTG GGC ATC TGG ATG GTC AGC ATC GTC 530 
Pro Arg Ala Arg Arg Ala Val Val Gly lie Trp Met Val Ser lie Val 
160 165 170 

TCC AGC ACC CTC TTT ATC ACC TAC TAC AAG CAC ACA GCC GTT CTG CTC 578 
Ser Ser Thr Leu Phe lie Thr Tyr Tyr Lys His Thr Ala Val Leu Leu 
175 180 : 185 
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TGC CTC GTC ACT TTC TTT CTA GCC ATG CTG GCA CTC ATG GCG ATT CTG 626 
Cys Leu Val Thr Phe Phe Leu Ala Met Leu Ala Leu Met Ala lie Leu 
190 195 200 

TAT GCC CAC ATG TTC ACG AGA GCG TGC CAG CAC GTC CAG GGC ATT GCC 674 
Tyr Ala His Met Phe Thr Arg Ala Cys Gin His Val Gin Gly He Ala 
205 210 215 220 

CAG CTC CAC AAA AGG CGG CGG TCC ATC CGC CAA GGC TTC TGC CTC AAG 722 
Gin Leu His Lys Arg Arg Arg Ser He Arg Gin Gly Phe Cys Leu Lys 
225 230 235 

GGT GCT GCC ACC CTT ACT ATC CTT CTG GGG ATT TTC TTC CTG TGC TGG 770 
Gly Ala Ala Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp 
240 245 250 

GGC CCC TTC TTC CTG CAT CTC TTG CTC ATC GTC CTC TGC CCT CAG CAC 818 
Gly Pro Phe Phe Leu His Leu Leu Leu He Val Leu Cys Pro Gin His 
255 260 265 

CCC ACC TGC AGC TGC ATC TTC AAG AAC TTC AAC CTC TTC CTC CTC CTC 866 
Pro Thr Cys Ser Cys He Phe Lys Asn Phe Asn Leu Phe Leu Leu Leu 
270 275 280 

ATC GTC CTC AGC TCC ACT GTT GAC CCC CTC ATC TAT GCT TTC CGC AGC 914 
He Val Leu Ser Ser Thr Val Asp Pro Leu He Tyr Ala Phe Arg Ser 
285 290 295 300 

CAG GAG CTC CGC ATG ACA CTC AAG GAG GTG CTG CTG TGC TCC TGG 959 
Gin Glu Leu Arg Met Thr Leu Lys Glu Val Leu Leu Cys Ser Trp 
305 310 315 

TGATCAGAGG GCGCTGGGCA GAGGGTGACA GTGATATCCA GTGGCCTGCA TCTGTGAGAC 1019 

CACAGGTACT CATCCCTTCC TGATCTCCAT . TTGTCTAAGG GTCGACAGGA TGAGCTTTAA 1079 

AATAGAAACC C AGAGTG CCT GGGGCCAGGA GAAAGGGTAA CTGTGACTGC AGGGCTCACC 1139 

CAGGGCAGCT ACGGGAAGTG GAGGAGACAG GGATGGGAAC TCTAGCCCTG AGCAAGGGTC 1199 

AGACCACAGG CTCCTGAAGA GCTTCACCTC TCCCCACCTA CAGGCAACTC CTGCTCAAGC 1259 

C 1260 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ser Thr Gin Glu Pro Gin Lys Ser Leu Leu Gly Ser Leu Asn Ser 
15 10 is 

Asn Ala Thr Ser His Leu Gly Leu Ala Thr Asn Gin Ser Glu Pro Trp 
20 25 30 

Cys Leu Tyr Val Ser He Pro Asp Gly Leu Phe Leu Ser Leu Gly Leu 

35 40 45 ' • 

Val Ser Leu Val Glu Asn Val Leu Val Val lie Ala He Thr Lys Asn 
50 55 €0 

Arg Asn Leu His Ser Pro Met Tyr Tyr Phe He Cys Cys Leu Ala Leu 
65 70 75 80 

Ser Asp Leu Met Val Ser Val Ser He Val Leu Glu Thr Thr lie He 
85 90 95 

Leu Leu Leu Glu Val Gly He Leu Val Ala Arg Val Ala Leu Val Gin 
100 105 110 

Gin Leu Asp Asn Leu He Asp Val Leu He Cys Gly Ser Met Val Ser 
115 120 125 

Ser Leu Cys Phe Leu Gly He He Ala He Asp Arg Tyr lie Ser He 
130 135 140 

Phe Tyr Ala Leu Arg Tyr His Ser lie Val Thr Leu Pro Arg Ala Arg 
145 150 155 160 

Arg Ala Val Val Gly lie Trp Met Val Ser He Val Ser Ser Thr Leu 
165 170 175 

Phe He Thr Tyr Tyr Lys His Thr Ala Val Leu Leu Cys Leu Val Thr 
180 185 190 

Phe Phe Leu Ala Met Leu Ala Leu Met Ala lie Leu Tyr Ala His Met 
195 200 205 

Phe Thr Arg Ala Cys Gin His Val Gin Gly lie Ala Gin Leu His Lys 
210 215 220 

Arg Arg Arg Ser lie Arg Gin Gly Phe Cys Leu Lys Gly Ala Ala Thr 
225 230 235 240 

Leu Thr lie Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro Phe Phe 
245 250 255 
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Leu His Leu Leu Leu He Val Leu Cys Pro Gin Hie Pro Thr Cys Ser 
260 265 270 

Cys He Phe Lys Asn Phe Asn Leu Phe Leu Leu Leu He Val Leu Ser 
275 280 285 

Ser Thr Val Asp Pro Leu He Tyr Ala Phe Arg Ser Gin Glu Leu Arg 
290 295 300 

Met Thr Leu Lys Glu Val Leu Leu Cys Ser Trp 
305 310 315 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(ix) FEATURE: 

(A) NAME /KEY: 5 ' UTR 

(B) LOCATION: 1..461 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 462.. 1415 

(ix) FEATURE: 

(A) NAME/KEY: 3 1 UTR 

(B) LOCATION: 1416.. 1633 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCCGCATGTG GCCGCCCTCA ATGGAGGGCT CTGAGAACGA CTTTTAAAAC GCAGAGAAAA 60 

AGCTCCATTC TTCCCAGACC TCAGCGCAGC CCTGGCCCAG GAAGGGAGGA GACAGAGGCC 120 

AGGACGGTCC AGAGGTGTCG AAATGTCCTG GGAACCTGAG CAGCAGCCAC CAGGGAAGAG 18 0 

GCAGGGAGGG AGCTGAGGAC CAGGCTTGGT TGTGAGAATC CCTGAGCCCA GGCGGTTGAT 24 0 

GCCAGGAGGT GTCTGGACTG GCTGGGCCAT GCCTGGGCTG ACCTGTCCAG CCAGGGAGAG 300 
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GGTGTGAGGG CAGATCTGGG GGTGCCCAGA TGGAAGGAGG CAGGCATGGG GACACCCAAG 360 

GCCCCCTGGC AGCACCATGA ACTAAGCAGG ACACCTGGAG GGGAAGAACT GTGGGGACCT 420 

GGAGGCCTCC AACGACTCCT TCCTGCTTCC TGGACAGGAC T ATG GCT GTG CAG 4 73 

Met Ala Val Gin 
1 

GGA TCC CAG AGA AGA CTT CTG GGC TCC CTC AAC TCC" ACC CCC ACA GCC 521 
Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser Thr Pro Thr Ala 
5 10 15 20 

ATC CCC CAG CTG GGG CTG GCT GCC AAC CAG ACA GGA GCC CGG TGC CTG 569 
lie Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly Ala Arg Cys Leu 
25 30 35 

GAG GTG TCC ATC TCT GAC GGG CTC TTC CTC AGC CTG GGG CTG GTG AGC 617 
Glu Val Ser lie Ser Asp Gly Leu Phe Leu Ser Leu Gly Leu Val Ser 
40 45 50 

TTG GTG GAG AAC GCG CTG GTG GTG GCC ACC ATC GCC AAG AAC CGG AAC 665 
Leu Val Glu Asn Ala Leu Val Val Ala Thr lie Ala Lys Asn Arg Asn 
55 60 65 

CTG CAC TCA CCC .ATG TAC TGC TTC ATC TGC TGC CTG GCC TTG TCG GAC 713 
Leu His Ser Pro Met Tyr Cys Phe lie Cys Cys Leu Ala Leu Ser Asp 
70 75 80 

CTG CTG GTG AGC GGG ACG AAC GTG CTG GAG ACG GCC GTC ATC CTC CTG 761 
Leu Leu Val Ser Gly Thr Asn Val Leu Glu Thr Ala Val He Leu Leu 
85 90 95 100 

CTG GAG GCC GGT GCA CTG GTG GCC CGG GCT GCG GTG CTG CAG CAG CTG BO 9 

Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val Leu Gin Gin Leu 
105 110 115 

GAC AAT GTC ATT GAC GTG ATC ACC TGC AGC TCC ATG CTG TCC AGC CTC 857 
Asp Asn Val He Asp Val He Thr Cys Ser Ser Met Leu Ser Ser Leu 
120 125 130 

TGC TTC CTG GGC GCC ATC GCC GTG GAC CGC TAC ATC TCC ATC TTC TAC 905 
Cys Phe Leu Gly Ala He Ala Val Asp Arg Tyr He Ser He Phe Tyr 
135 140 145 

GCA CTG CGC TAC CAC AGC ATC GTG ACC CTG CCG CGG GCG CCG CGA GCC 953 
Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg Ala Pro Arg Ala 
150 155 160 

GTT GCG GCC ATC TGG GTG GCC AGT GTC GTC TTC AGC ACG CTC TTC ATC 1001 
Val Ala Ala lie Trp Val Ala Ser Val Val Phe Ser Thr Leu Phe He 
165 170 175 180 
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GGC TAC TAC GAC CAC GTG GCC GTC CTG CTG TGC CTC GTG GTC TTC TTC 104 9 

Gly Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu Val Val Phe Phe 
185 190 195 

CTG GCT ATG CTG GTG CTC ATG GCC GTG CTG GAC GTC CAC ATG CTG GCC 1097 
Leu Ala Met "Leu Val Leu Met Ala Val Leu Asp Val His Met Leu Ala 
200 205 210 

CGG GCC TGC CAG CAC GCC CAG GGC ATC GCC CGG CTC CAC AAG AGG CAG 1145 
Arg Ala Cys Gin His Ala Gin Gly lie Ala Arg Leu His Lys Arg Gin 
215 220 225 

CGC CCG GTC CAC CAG GGC TTT GGC CTT AAA GGC GCT GTC ACC CTC ACC 1193 
Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala Val Thr Leu Thr 
230 235 240 

ATC CTG CTG GGC ATT TTC TTC CTC TGC TGG GGC CCC TTC TTC CTG CAT 1241 
lie Leu Leu Gly lie Phe Phe Leu Cys Trp Gly Pro Phe Phe Leu His 
245 250 255 260 

CTC ACA CTC ATC GTC CTC TGC CCC GAG CAC CCC ACG TGC GGC TGC ATC 12 B 9 

Leu Thr Leu He Val Leu Cys Pro Glu His Pro Thr Cys Gly Cys He 
265 270 275 

TTC AAG AAC TTC AAC CTC TTT CTC GCC CTC ATC ATC TGC AAT GCC ATC 1337 
Phe Lys Asn Phe Asn Leu Phe Leu Ala Leu He He Cys Asn Ala He 
280 285 290 

ATC GAC CCC CTC ATC TAC GCC TTC CAC AGC CAG GAG CTC CGC AGG ACG 1385 
He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu Leu Arg Arg Thr 
295 300 305 

CTC AAG GAG GTG CTG ACA TGC TCC TGG TGA GCGCGGTGCA CGCGCTTTAA 1435 
Leu Lys Glu Val Leu Thr Cys Ser Trp * 
310 315 

GTGTGCTGGG CAGAGGGAGG TGGTGATATT GTGGTCTGGT TCCTGTGTGA CCCTGGGCAG 14 95 

TTCCTTACCT CCCTGGTCCC CGTTTGTCAA AGAGGATGGA CTAAATGATC TCTGAAAGTG 1555 

TTGAAGCGCG GACCCTTCTG GGCAGGGAGG GGTCCTGCAA AACTCCAGGC AGGACTTCTC 1615 

ACCAGCAGTC GTGGGAAC 1633 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Val Gin Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser 
15 10 15 

Thr Pro Thr Ala lie Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly 
20 25 30 

Ala Arg Cys Leu Glu Val Ser He Ser Asp Gly Leu Phe Leu Ser Leu 
35 40 45 

Gly Leu Val Ser Leu Val Glu Asn Ala Leu Val Val Ala Thr He Ala 
50 55 60 

Lys Asn Arg Asn Leu His Ser Pro Met Tyr Cys Phe He Cys Cys Leu 
65 70 75 80 

Ala Leu Ser Asp Leu Leu Val Ser Gly Thr Asn Val Leu Glu Thr Ala 
85 90 95 

Val He Leu Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val 
100 105 110 

Leu Gin Gin Leu Asp Asn Val He Asp Val He Thr Cys Ser Ser Met 
115 120 125 

Leu Ser Ser Leu Cys Phe Leu Gly Ala lie Ala Val Asp Arg Tyr He 
130 135 140 

Ser He Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg 
145 150 155 160 

Ala Pro Arg Ala Val Ala Ala He Trp Val Ala Ser Val Val Phe Ser 
165 170 175 

Thr Leu Phe He Gly Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu 
180 185 190 

Val Val Phe Phe Leu Ala Met Leu Val Leu Met Ala Val Leu Asp Val 
195 200 205 

His Met Leu Ala Arg Ala Cys Gin His Ala Gin Gly He Ala Arg Leu 
210 215 220 

His Lys Arg Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala 
225 230 235 240 

Val Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro 
245 250 255 
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Phe Phe Leu His Leu Thr lieu 'He 
260 

Cys Gly Cys He Phe Lys Asn Phe 
275 280 

Cys Asn Ala He He Asp Pro Leu 
290 295 

Leu Arg Arg Thr Leu Lys Qlu Val 
305 310 



Val Leu Cys Pro Glu His Pro Thr 
265 270 

Asn Leu Phe Leu Ala Leu He He 
285 

He Tyr Ala Phe His Ser Gin Glu 
300 

Leu Thr Cys Ser Trp * 
315 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2012 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(ix) FEATURE: 

(A) NAME/ KEY : 5'UTR 

(B) LOCATION: 1..693 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 694.. 1587 

( ix ) FEATURE : 

(A) NAME /KEY : 3'UTR 

(B) LOCATION: 1588.. 2012 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACAACACTTT ATATATATTT TTATAAATGT AAGGGGTACA AAGGTGCCAT TTTGTTACAT 60 

GGATATACCG TGTAGTGGTG AAGCCTGGGC TTTTAGTGTA TCTGTCATCA GAATAACATA 120 

CGTGTTACCC ATAGGAATTT CTCATCACCC GCCCCCTCCA CCCTTCGAGT CTCCAATGTC 180 

CATTCCACAC TCTATATCCA CGTGTATGCA TATAGCTCCA CATATAAGTG AGAACATGTA 240 

GTATTTGACT TCCTCTTTCT GAGTTATTTC ACTTTGATAA TGGCCTCCAC TTCCATCCAT 300 

GTTGCTGCAA AAGACATGAC CTTATTCTTT TTGATAGCTG GGGAGTACTC CATTGTGTAT 360 

ATGTACCACA TTTCTTTATC CATTCACCCA TTGAGAACAC TTAGTTGATT CCATATCTTT 420 



49 



WO 98/10068 



PCT/US97/15565 



GCTATTGTCA CTAGTGCTGC AATAAACATA CATGTG CAGG CTCCTTCTAA TATACTGATT 4 80 

TATATTTTAT GGAGAGAGAT AGAGTTCTTA GCGAGTGTGC TGTTTATTTC TAGTGTACTT 540 

GCAACTAATA TTeTGTATAC TCCCTTTAGG TGATTGGAGA TTTAACTTAG ATCTCCAGCA 600 

AGTGCTACAA GAAGAAAAGA TCCTGAAGAA TCAATCAAGT TTCCGTGAAG TCAAGTCCAA 660 

GTAACATCCC CGCCTTAACC ACAAGCAGGA GAA ATG AAG CAC ATT ATC AAC TCG 714 

Met LyB His lie lie Asn Ser 
1 5 

TAT GAA AAC ATC AAC AAC ACA GCA AGA AAT AAT TCC GAC TGT CCT CGT 762 
Tyr Glu Asn lie Asn Asn Thr Ala Arg Asn Asn Ser Asp Cys Pro Arg 
10 15 20 

TGT GTT TTG CCG GAG GAG ATA TTT TTC ACA ATT TCC ATT GTT GGA GTT 810 
Cys Val Leu Pro Glu Glu lie Phe Phe Thr lie Ser lie Val Gly Val 
25 30 35 

TTG GAG AAT CTG ATC GTC CTG CTG GCT GTG TTC AAG AAT AAG AAT CTC 858 
Leu Glu Asn Leu lie Val Leu Leu Ala Val Phe Lys Asn Lys Asn Leu 
40 45 50 55 

CAG GCA CCC ATG TAG TTT TTC ATC TGT AGC TTG GCC ATA TCT GAT ATG 906 
Gin Ala Pro Met Tyr Phe Phe He Cys Ser Leu Ala He Ser Asp Met 
60 65 70 

CTG GGC AGC CTA TAT AAG ATC TTG GAA AAT ATC CTG ATC ATA TTG AGA 954 
Leu Gly Ser Leu Tyr Lys He Leu Glu Asn He Leu He He Leu Arg 
75 80 85 

AAC ATG GGC ATA CTC AAG CCA CGT GGC AGT TTT GAA ACC ACA GCC CAT 1002 
Asn Met Gly lie Leu Lys Pro Arg Gly Ser Phe Glu Thr Thr Ala His 
90 95 100 

GAC ATC ATC GAC TCC CTG TTT CTG CTC TCC CGT CTT GGC TCC ATC TTC 1050 
Asp He He Asp Ser Leu Phe Leu Leu Ser Arg Leu Gly Ser He Phe 
105 HO 115 

GAC CTG CTC GTG ATT GCT GCG GAC CGC TAC ATC ACC ATC TTC CAC GCA 1098 
Asp Leu Leu Val He Ala Ala Asp Arg Tyr He Thr He Phe His Ala 
120 125 130 135 

CTG CGG TAC CAC AGC ATC GTG ACC ATG CGC CGC ACT GTG GTG GTG CTT 1146 
Leu Arg Tyr His Ser He Val Thr Met Arg Arg Thr Val Val Val Leu 
140 145 150 

ACG GTC ATC TGG ACG TTC TGC ACG GGG ACT GGC ATC ACC ATG GTG ATC 1194 
Thr Val He Trp Thr Phe Cys Thr Gly Thr Gly He Thr Met Val He 
155 160 165 
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TTC TCC CAT CAT GTG CCC CAC GTG ATC ACC TTC ACG TCG CTG TTC CCG 1242 
Phe Ser His His Val Pro His Val lie Thr Phe Thr Ser Leu Phe Pro 
170 175 1B0 

CTG ATG CTG GTC TTC ATC CTG TGC CTC TAT GTG CAC ATG TTC CTG CTG 12 90 

Leu Met Leu' Val Phe He Leu Cys Leu Tyr Val His Met Phe Leu Leu 
185 190 195 

GCT CGA TGG CAC ACC AGG AAG ATC TCC ACC CTC CCC AGA GCC AAC ATG 1338 
Ala Arg Trp His Thr Arg Lys lie Ser Thr Leu Pro Arg Ala Asn Met 
200 205 210 215 

AAA GGG GCC ATG ACA CTG ACC ATC CTG CTC GGG GTC TTC ATC TTC TGC 13 86 

Lys Gly Ala Met Thr Leu Thr He Leu Leu Gly Val Phe He Phe Cys 
220 225 230 

TGG GCC CCC TTT GTG CTT CAT GTC CTC TTG ATG ACA TTC TGC CCA AGT 1434 
Trp Ala Pro Phe Val Leu His Val Leu Leu Met Thr Phe Cys Pro Ser 
235 240 245 

AAC CCC TAG TGC GCC TGC TAC ATG TCT CTC TTC CAG GTG AAC GGC ATG 1482 
Asn Pro Tyr Cys Ala Cys Tyr Met Ser Leu Phe Gin Val Asn Gly Met 
250 255 260 

TTG ATC ATG TGC AAT GCC GTC ATT GAC CCC TTC ATA TAT GCC TTC CGG 1530 
Leu He Met Cys Asn Ala Val He Asp Pro Phe He Tyr Ala Phe Arg 
265 270 275 

AGC CCA GAG CTC AGG GAC GCA TTC AAA AAG ATG ATC TTC TGC AGC AGG 1578 
Ser Pro Glu Leu Arg Asp Ala Phe Lys Lys Met He Phe Cys Ser Arg 
280 285 290 295 

TAC TGG TAG AATGGCTGAT CCCTGGTTTT AGAATCCATG GGAATAACGT 1627 
Tyr Trp * 



TGCCAAGTGC 
TTCCCTAATG 
CAGTCTATTG 
GGATAGAATA 
CATTACTTTT 
CAAAGACTTG 
AGGCCATTGC 



CAGAATAGTG 
GATGCAAGGA 
TAGGGGCAAC 
CAAAGTATTA 
GCACCAATCT 
CTTTCCTGTG 
AGCCAATTTC 



TAACATTCCA 
TGACCCACCA 
TCTATTTGTG 
GGTACAAAAG 
AGTAAAACAG 
GACATTAACA 
AGAGT 



ACAAATGCCA 
GCTAGTGTTT 
ACTGGACAGA 
TAATTAGGTT 
CAATAAAAAT 
AGCCAGTTCT 



GTGCTCCTCA 
CTGAATACTA 
TAAAACGTGT 
TGCATTACTT 
TCAAGGGCTT 
GAGGCGGCCT 



CTGGCCTTCC 
TGGCCAGGAA 
AGTAAAAGAA 
ATGACAAATG 
TGGGCTAAGG 
TTCCAGGTGG 



1687 
1747 
1807 
1867 
1927 
1987 
2012 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 97 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Lys His lie lie Asn Ser Tyr Glu Asn lie ABn Asn Thr Ala Arg 
1 5 10 15 

Asn Asn Ser Asp Cys Pro Arg Cys Val Leu Pro Glu Glu lie Phe Phe 
20 25 30 

Thr He Ser He Val Gly Val Leu Glu Asn Leu He Val Leu Leu Ala 
35 40 45 

Val Phe Lys Asn Lys Asn Leu Gin Ala Pro Met Tyr Phe Phe He Cys 
50 55 60 

Ser Leu Ala He Ser Asp Met Leu Gly Ser Leu Tyr Lys He Leu Glu 
65 70 75 80 

Asn He Leu He He Leu Arg Asn Met Gly He Leu Lys Pro Arg Gly 
85 90 95 

Ser Phe Glu Thr Thr Ala His Asp He He Asp Ser Leu Phe Leu Leu 
100 105 110 

Ser Arg Leu Gly Ser He Phe Asp Leu Leu Val He Ala Ala Asp Arg 
115 120 125 

Tyr He Thr He Phe His Ala Leu Arg Tyr His Ser He Val Thr Met 
130 135 140 

Arg Arg Thr Val Val Val Leu Thr Val He Trp Thr Phe Cys Thr Gly 
145 150 155 160 

Thr Gly He Thr Met Val He Phe Ser His His Val Pro His Val He 
165 170 175 

Thr Phe Thr Ser Leu Phe Pro Leu Met Leu Val Phe He Leu Cys Leu 
180 185 190 

Tyr Val His Met Phe Leu Leu Ala Arg Trp His Thr Arg Lys He Ser 
195 200 205 

Thr Leu Pro Arg Ala Asn Met Lys Gly Ala Met Thr Leu Thr He Leu 
210 215 220 
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Leu Gly Val Phe lie Phe Cys Trp 
225 230 

Leu Met Thr Phe Cys Pro Ser Asn 
245 

Leu Phe Gin Val Asn Gly Met Leu 
260 

Pro Phe lie Tyr Ala Phe Arg Ser 
275 280 

Lys Met lie Phe Cys Ser Arg Tyr 
290 295 



Ala Pro Phe Val Leu His Val Leu 
235 240 

Pro Tyr Cys Ala Cys Tyr Met Ser 
250 255 

lie Met Cys Asn Ala Val lie Asp 
265 270 

Pro Glu Leu Arg Asp Ala Phe Lys 
285 

Trp * 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(ix) FEATURE: 

(A) NAME/ KEY: 5 ' UTR 

(B) LOCATION: 1. .132 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 133.. 1026 

(ix) FEATURE: 

(A) NAME /KEY : 3 ! UTR 

(B) LOCATION: 1027.. 1106 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GGGGCCAGAA AGTTCCTGCT TCAGAGCAGA AGATCTTCAG CAAGAACTAC AAAGAAGAAA 60 

AGATTCTGGA GAATCAATCA AGTTTCCTGT CAAGTTCCAG TAACGTTTCT GTCTTAACTG 120 

CACACAGGAA AG ATG AAA CAC ATT CTC AAT CTG TAT GAA AAC CTC AAC 168 
Met Lys His lie Leu Asn Leu Tyr Glu Asn Leu ABn 
1 5 10 
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AGT ACA GCA AGA AAT AAC TCA GAC TGT CCT GCT GTG ATT TTG CCA GAA 216 
Ser Thr Ala Arg Asn Asn Ser Asp Cys Pro Ala Val lie Leu Pro Glu 
15 20 25 

GAG ATA TTT TTC ACA GTA TCC ATT GTT GGG GTT TTG GAG AAC CTG ATG 264 
Glu He Phe Phe Thr Val Ser He Val Gly Val Leu Glu Asn Leu Met 
30 35 40 

GTC CTT CTG GCT GTG GCC AAG AAT AAG ATG CTT CAG" TCG CCC ATG TAC 312 
Val Leu Leu Ala Val Ala Lye Asn Lys Met Leu Gin Ser Pro Met Tyr 
45 50 55 60 

TTT TTC ATC TGC AGC TTG GCT ATT TCC GAT ATG CTG GGG AGC ATG TAC 360 
Phe Phe He Cys Ser Leu Ala He Ser Asp Met Leu Gly Ser Met Tyr 
65 70 75 

AAG ATT TTG GAA AAC GTT CTG ATC ATG TTC AAA AAC ATG GGT TAC CTC 4 08 

Lys He Leu Glu Asn Val Leu He Met Phe Lys Asn Met Gly Tyr Leu 
80 85 90 

GAG CCT CGA GGC AGT TTT GAA AGC ACA GCA GAT GAT GTG GTG GAC TCC 456 
Glu Pro Arg Gly Ser Phe Glu Ser Thr Ala Asp Asp Val Val Asp Ser 
95 100 105 

CTG TTC ATC CTC TCC CTT CTC GGC TCC ATC TGC AGC CTG TCT GTG ATT 504 
Leu Phe He Leu Ser Leu Leu Gly Ser He Cys Ser Leu Ser Val He 
110 115 120 

GCC GCT GAC CGC TAC ACT ACA ATC TTC CAC GCT CTG CAG TAC CAC CGC 552 
Ala Ala Asp Arg Tyr Thr Thr He Phe His Ala Leu Gin Tyr His Arg 
125 130 135 140 

ATC ATG ACC CCC GCA CCG TGC CCT CGT CAT CTG ACG GTC CTC TGG CGA 600 
He Met Thr Pro Ala Pro Cys Pro Arg His Leu Thr Val Leu Trp Arg 
145 . 150 155 



GGC TGC ACA GGC AGT GGC ATT ACC ATC GTG ACC TTC TCC CAT CAC GTC 648 
Gly Cys Thr Gly Ser Gly He Thr lie Val Thr Phe Ser His His Val 
160 165 170 

CCC ACA GTG ATC GCC TTC ACA GCG CTG TTC CCG CTG ATG CTG GCC TTC 696 
Pro Thr Val He Ala Phe Thr Ala Leu Phe Pro Leu Met Leu Ala Phe 
175 180 185 

ATC CTG TGC CTC TAC GTG CAC ATG TTC CTG CTG GCC CGC TCC CAC ACC 744 
He Leu Cys Leu Tyr Val His Met Phe Leu Leu Ala Arg Ser His Thr 
190 195 200 

AGG AGG ACC CCC TCC CTT CCC AAA GCC AAC ATG AGA GGG GCC GTC ACA 792 
Arg Arg Thr Pro Ser Leu Pro Lys Ala Asn Met Arg Gly Ala Val Thr 
205 210 215 220 
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CTG ACT GTC CTG CTC GGG GTC TTC ATT TTC TGT TGG GCA CCC TTT GTC 840 
Leu Thr Val Leu Leu Gly Val Phe lie Phe Cys Trp Ala Pro Phe Val 
225 230 235 

CTT CAT GTC CTC TTG ATG ACA TTC TGC CCA GCT GAC CCC TAC TGT GCC 888 
Leu His Val "Leu Leu Met Thr Phe Cys Pro Ala Asp Pro Tyr Cys Ala 
240 245 250 

TGC TAC ATG TCC CTC TTC CAG GTG AAT GGT GTG TTG ATC ATG TGT AAT 936 
Cys Tyr Met Ser Leu Phe Gin Val Asn Gly Val Leu lie Met Cys Asn 
255 260 265 

GCC ATC ATC GAC CCC TTC ATA TAT GCC TTT CGG AGC CCA GAG CTC AGG 984 
Ala lie He Asp Pro Phe He Tyr Ala Phe Arg Ser Pro Glu Leu Arg 
270 275 280 

GTC GCA TTC AAA AAG ATG GTT ATC TGC AAC TGT TAC CAG TAG 1026 
Val Ala Phe Lys Lys Met Val He Cys Asn Cys Tyr Gin * 
285 290 295 

AATGATTGGT CCCTGATTTT AGGAGCCACA GGGATATACT GTCAGGGACA GAGTAGCGTG 1086 
ACAGACCAAC AACACTAGGA CT HOB 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 297 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 

Met Lys His He Leu Asn Leu Tyr Glu Asn Leu Asn Ser Thr Ala Arg 
15 10 15 

Asn Asn Ser Asp Cys Pro Ala Val He Leu Pro Glu Glu He Phe Phe 
20 25 30 

Thr Val Ser He Val Gly Val Leu Glu Asn Leu Met Val Leu Leu Ala 
35 40 45 

Val Ala Lys Asn Lys Met Leu Gin Ser Pro Met Tyr Phe Phe He Cys 
50 55 60 

Ser Leu Ala He Ser Asp Met Leu Gly Ser Met Tyr Lys He Leu Glu 
65 70 75 80 
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Asn Val Leu He Met Phe Lys Asn Met Gly Tyr Leu Glu Pro Arg Gly 
85 90 95 

Ser Phe Glu Ser Thr Ala Asp Asp Val Val Asp Ser Leu Phe He Leu 
100 105 , 110 

Ser Leu Leu Gly Ser He Cys Ser Leu Ser Val He Ala Ala Asp Arg 
115 120 125 

Tyr Thr Thr He Phe His Ala Leu Gin Tyr His Arg He Met Thr Pro 
130 135 140 

Ala Pro Cys Pro Arg His Leu Thr Val Leu Trp Arg Gly Cys Thr Gly 
145 150 155 160 

Ser Gly He Thr He Val Thr Phe Ser His His Val Pro Thr Val He 
165 170 175 

Ala Phe Thr Ala Leu Phe Pro Leu Met Leu Ala Phe He Leu Cys Leu 
180 185 190 

Tyr Val His Met Phe Leu Leu Ala Arg Ser His Thr Arg Arg Thr Pro 
195 200 205 

Ser Leu Pro Lys Ala Asn Met Arg Gly Ala Val Thr Leu Thr Val Leu 
210 215 220 

Leu Gly Val Phe He Phe Cys Trp Ala Pro Phe Val Leu His Val Leu 
225 230 235 240 

Leu Met Thr Phe Cys Pro Ala Asp Pro Tyr Cys Ala Cys Tyr Met Ser 
245 250 255 

Leu Phe Gin Val Asn Gly Val Leu He Met Cys Asn Ala He He Asp 
260 265 270 

Pro Phe He Tyr Ala Phe Arg Ser Pro Glu Leu Arg Val Ala Phe Lys 
275 280 285 

Lys Met Val He Cys Asn Cys Tyr Gin * 
290 295 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 B base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: "CDNA to mRNA 



PCT/US97/15565 



(ix) FEATURE: 

(A) NAME /KEY: 5 'UTR 

(B) LOCATION: 1..297 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 298.. 1269 

(ix) FEATURE: 

(A) NAME/KEY: 3 'UTR 

(B) LOCATION: 1270.. 1338 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCTGTAACT GTAGCAACCG GTGTTGGGTG GGGATGAGAA GAGACCAGAG AGAGAGAGGG 60 

TCAGAGCGAC AGGGGATGAG ACAGGCTGGT CAGAGTCTGC ACTGATTGTT GGAGACGCAA 120 

AGGAAAGTTT TTTCTATGTC TCCAACCTCC CCCTCCTCCC CCGTTTCTCT CTGGAGAAAC 180 

TAAAATGTAG ACTGGACAGC ATCCACAAGA GAAGCACCTA GAAGAAGATT TTTTTTTCCC 240 

AGCAGCTTGC TCAGGACCCT GCAGGAGCTG CAGCCGGAAC TGGTCCCGCC GATAACC 297 

ATG AAC TCT TCC TGC TGC CCG TCC TCC TCT TAT CCG ACG CTG CCT AAC 34 5 

Met Asn Ser Ser Cys Cys Pro Ser Ser Ser Tyr Pro Thr Leu Pro Asn 
1 5 .10 15 

CTC TCC CAG CAC CCT GCA GCC CCC TCT GCC AGC AAC CGG AGT GGC AGT 393 
Leu Ser Gin His Pro Ala Ala Pro Ser Ala Ser Asn Arg Ser Gly Ser 
20 25 30 

GGG TTC TGC GAG CAG GTT TTC ATC AAG CCA GAG GTC TTC CTG GCA CTG 441 
Gly Phe Cys Glu Gin Val Phe lie Lys Pro Glu Val Phe Leu Ala Leu 
35 40 45 

GGC ATC GTC AGT CTG ATG GAA AAC ATC CTG GTG ATC CTG GCT GTG GTG 4 89 

Gly He Val Ser Leu Met Glu Asn He Leu Val He Leu Ala Val Val 
50 55 60 

AGG AAC GGC AAC CTG CAC TCC CCC ATG TAC TTC TTC CTG CTG AGC CTG 537 
Arg Asn Gly Asn Leu His Ser Pro Met Tyr Phe Phe Leu Leu Ser Leu 
65 70 75 80 

CTG CAG GCC GAC CTG CTG GTG AGC CTG TCC AAC TCC CTG GAG ACC ATC 58 5 

Leu Gin Ala Asp Leu Leu Val Ser Leu Ser Asn Ser Leu Glu Thr He 
85 90 95 
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ATG ATC GTG GTT ATC AAC AGC GAC TCC CTG ACC TTG GAG GAC CAA TTC 633 
Met lie Val Val lie Asn Ser Asp Ser Leu Thr Leu Glu Asp Gin Phe 
100 105 110 

ATC CAG CAC ATG GAC AAC ATC TTC GAC TCT ATG ATC TGC ATC TCC CTG 681 
lie Gin His Met Asp Asn lie Phe Asp Ser Met lie Cys lie Ser Leu 
115 120 125 

GTG GCC TCC ATC TGC AAC CTC CTG GCC ATC GCC GTG~ GAC AGG TAC GTC 72 9 

Val Ala Ser lie Cys Asn Leu Leu Ala lie Ala Val Asp Arg Tyr Val 
130 135 140 

ACC ATC TTC TAT GCC CTC CGT TAC CAC AGC ATC ATG ACG GTT AGG AAA 777 
Thr lie Phe Tyr Ala Leu Arg Tyr His Ser lie Met Thr Val Arg Lys 
145 150 155 160 

GCC CTC TCC TTG ATC GTG GCC ATC TGG GTC TGC TGT GGC ATC TGC GGC B25 
Ala Leu Ser Leu lie Val Ala lie Tip Val Cys Cys Gly lie Cys Gly 
165 170 175 

GTG ATG TTC ATC GTC TAC TCC GAG AGC AAG ATG GTC ATC GTG TGC CTC 873 
Val Met Phe lie Val Tyr Ser Glu Ser Lys Met Val lie Val Cys Leu 
180 185 190 

ATC ACC ATG TTC TTC GCC ATG GTG CTC CTC ATG GGC ACC CTG TAC ATC 921 
He Thr Met Phe Phe Ala Met Val Leu Leu Met Gly Thr Leu Tyr He 
195 200 205 

CAC ATG TTC CTC TTC GCC AGG, CTG CAC GTC CAG CGC ATC GCG GCA CTG 969 
His Met Phe Leu Phe Ala Arg Leu His Val Gin Arg He Ala Ala Leu 
210 215 220 

CCA CCT GCT GAC GGG CTA GCC CCG CAG CAG CAC TCG TGC ATG AAG GGG 1017 
Pro Pro Ala Asp Gly Leu Ala Pro Gin Gin His Ser Cys Met Lys Gly 
225 230 235 240 

GCC GTC ACC ATC ACC ATC CTG CTG GGG GTT TTC ATC TTC TGC TGG GCG 1065 
Ala Val Thr He Thr lie Leu Leu Gly Val Phe He Phe Cys Trp Ala 
245 250 255 

CCT TTC TTC CTC CAC CTG GTC CTC ATC ATC ACC TGC CCC ACC AAC CCC 1113 
Pro Phe Phe Leu His Leu Val Leu He lie Thr Cys Pro Thr Asn Pro 
260 265 270 

TAC TGC ATC TGC TAC ACG GCG CAC TTC AAC ACC TAC CTG GTT CTC ATC 1161 
Tyr Cys lie Cys Tyr Thr Ala His Phe Asn Thr Tyr Leu Val Leu He 
275 280 285 

ATG TGC AAC TCT GTC ATC GAC CCC CTC ATC TAC GCC TTC CGC AGC CTG 1209 
Met Cys Asn Ser Val lie Asp Pro Leu lie Tyr Ala Phe Arg Ser Leu 
290 295 300 
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GAG CTG CGA AAC ACC TTC AAG GAG ATT CTC TGC GGT TGC AAT GGC ATG 1257 
Glu Leu Arg Asn Thr Phe Lys Glu lie Leu Cys Gly Cys Asn Gly Met 
305 310 315 320 

AAC GTG GGC TAG GAACCCCCGA GGAGGTGTTC CACGGCTAGC CAAGAGAGAA 13 09 

Asn Val Gly* * 



AAGCAATGCT CAGGTGAGAC ACAGAAGGG 1338 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Asn Ser Ser Cys Cys Pro Ser Ser Ser Tyr Pro Thr Leu Pro Asn 
15 10 15 

Leu Ser Gin His Pro Ala Ala Pro Ser Ala Ser Asn Arg Ser Gly Ser 
20 25 30 

Gly Phe Cys Glu Gin Val Phe lie Lys Pro Glu Val Phe Leu Ala Leu 
35 40 45 

Gly He Val Ser Leu Met Glu Asn He Leu Val He Leu Ala Val Val 
50 55 60 

Arg Asn Gly Asn Leu His Ser Pro Met Tyr Phe Phe Leu Leu Ser Leu 
65 70 75 80 

Leu Gin Ala Asp Leu Leu Val Ser Leu Ser Asn Ser Leu Glu Thr He 
85 90 95 

Met He Val Val He Asn Ser Asp Ser Leu Thr Leu Glu Asp Gin Phe 
100 105 110 

He Gin His Met Asp Asn He Phe Asp Ser Met He Cys He Ser Leu 
115 120 125 

Val Ala Ser He Cys Asn Leu Leu Ala He Ala Val Asp Arg Tyr Val 
130 135 140 

Thr He Phe Tyr Ala Leu Arg Tyr His Ser He Met Thr Val Arg Lys 
145 150 155 160 
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Ala Leu Ser Leu He 
165 

Val Met Phe He Val 

iao 

He Thr Met Phe Phe 
195 

His Met Phe Leu Phe 
210 

Pro Pro Ala Asp Gly 
225 

Ala Val Thr He Thr 
245 

Pro Phe Phe Leu His 
260 

Tyr Cys He Cys Tyr 
275 

Met Cys Asn Ser Val 
2 90 

Glu Leu Arg Asn Thr 
305 

Asn Val Gly * 



Val Ala He Trp Val Cys 
170 

Tyr Ser Glu Ser Lys Met 
185 

Ala Met Val Leu Leu Met 
200 

Ala Arg Leu His Val Gin 
215 

Leu Ala Pro Gin Gin His 
230 235 

He Leu Leu Gly Val Phe 
250 

Leu Val Leu He He Thr 
265 

Thr Ala His Phe Asn Thr 
280 

He Asp Pro Leu He Tyr 
295 

Phe Lys Glu He Leu Cys 
310 315 



Cys Gly He Cys Gly 
175 

Val He Val Cys Leu 
190 

Gly Thr Leu Tyr He 
205 

Arg He Ala Ala Leu 
220 

Ser Cys Met Lys Gly 
240 

He Phe Cys Trp Ala 
255 

Cys Pro Thr Asn Pro 

270 

Tyr Leu Val Leu He 
285 

Ala Phe Arg Ser Leu 
300 

Gly Cys Asn Gly Met 
320 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: micsJE eature 

(B) LOCATION: 1..30 

(D) OTHER INFORMATION: /function ■ "Degenerate 
oligonucleotide primer (sense*)" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 
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GAGTCGACCR CCCATGTAYT DYTTCATCTG 



PCTAJS97/1S565 



30 



{2) INFORMATION FOR SEQ ID NO; 15; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1671 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 

( ix ) FEATURE : 

(A) NAME/ KEY : 5 'UTR 

(B) LOCATION: 1..393 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 394.. 1389 

(ix) FEATURE: 

(A) NAME /KEY : 3 'UTR 

(B) LOCATION: 1390.. 1671 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

AGCTTCCGAG AGGCAGCCGA TGTGAGCATG TGCGCACAGA TTCGTCTCCC AATGGCATGG 60 

CAGCTTCAAG GAAAATTATT TTGAACAGAC TTGAATGCAT AAGATTAAAG TTAAAGCAGA 120 

AGTGAGAACA AGAAAGCAAA GAGCAGACTC TTTCAACTGA GAATGAATAT TTTGAAGCCC 180 

AAGATTTTAA CGTGATGATG ATTAGAGTCG TACCTAAAAG AGACTAAAAA CTCCATGTCA 240 

AGCTCTGGAC TTGTGACATT TACTCACAGC AGGCATGGCA ATTTTAGCCT CACAACTTTC 300 

AGACAGATAA AGACTTGGAG GAAATAACTG AGACGACTCC CTGACCCAGG AGGTTAAATC 360 

AATTCAGGGG GACACTGGAA TTCTCCTGCC AGC ATG GTG AAC TCC ACC CAC CGT 414 



Met Val Asn Ser Thr His Arg 
1 5 



GGG ATG CAC ACT TCT CTG CAC CTC TGG 
Gly Met His Thr Ser Leu His Leu Trp 
10 15 



AAC CGC 
Asn Arg 



AGC AGT TAC AGA CTG 
Ser Ser Tyr Arg Leu 
20 



462 



CAC AGC AAT GCC AGT GAG TCC CTT GGA 
His Ser Asn Ala Ser Glu Ser Leu Gly 
25 30 



AAA GGC 
Lys Gly 



TAC TCT GAT GGA GGG 
iyr Ser Asp Gly Gly 
35 



510 
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TGC TAC GCG CAA CTT TTT GTC TCT CCT GAG GTG TTT GTG ACT CTG GGT 558 
Cys Tyr Ala Gin Leu Phe Val Ser Pro Glu Val Phe Val Thr Leu Gly 
40 45 50 55 

GTG ATC AGC TTG TTG GAG AAT ATC TTA GAG ATT GTG GCA ATA GCC AAG 606 
Val lie Ser Leu Leu Glu Asn He Leu Glu He Val Ala He Ala Lys 
60 65 70 

AAC AAG AAT CTG CAT TCA CCC ATG TAC TTT TTC ATC TGC AGC TTG GCT 654 
Asn Lys Asn Leu His Ser Pro Met Tyr Phe Phe He Cys Ser Leu Ala 
75 80 85 

GTG GCT GAT ATG CTG GTG AGC GTT TCA AAT GGA TCA GAA ACC ATT ATC 702 
Val Ala Asp Met Leu Val Ser Val Ser Asn Gly Ser Glu Thr He He 
90 95 100 

ATC ACC CTA TTA AAC CGT ACA GAT ACG GAT GCA CAG AGT TTC ACA GTG 750 
He Thr Leu Leu Asn Arg Thr Asp Thr Asp Ala Gin Ser Phe Thr Val 
105 110 115 

AAT ATT GAT AAT GTC ATT GAC TCG GTG ATC TGT AGC TCC TTG CTT GCA 798 
Asn He Asp Asn Val He Asp Ser Val He Cys Ser Ser Leu Leu Ala 
120 125 130 135 

TCC ATT TGC AGC CTG CTT TCA ATT GCA GTG GAC AGG TAC TTT ACT ATC 846 
Ser lie Cys Ser Leu Leu Ser He Ala Val Asp Arg Tyr Phe Thr He 
140 145 150 

TTC TAT GCT CTC CAG TAC CAT AAC ATT ATG ACA GTT AAG CGG GTT GGG 894 
Phe Tyr Ala Leu Gin Tyr His Asn He Met Thr Val Lys Arg Val Gly 
155 160 165 

ATC AGC ATA AGT TGT ATC TGG GCA GCT TGC ACG GTT TCA GGT ATT TTG 942 
He Ser He Ser Cys He Trp Ala Ala Cys Thr Val Ser Gly He Leu 
170 175 180 

TTC ATC ATT TAC TCA GAT AGT AGT GCT GTC ATC ATC TGC CTC ATC ACC 990 
Phe He He Tyr Ser Asp Ser Ser Ala Val He He Cys Leu lie Thr 
185 190 195 

ATG TTC TTC ACC ATG CTG GCT CTC ATG GCT TCT CTC TAT GTC CAC CTG 1038 
Met Phe Phe Thr Met Leu Ala Leu Met Ala Ser Leu Tyr Val His Leu 
200 205 210 215 

TTC CTG ATG GCC AGG CTT CAC ATT AAG AGG ATT GCT GTC CTC CCC GGC 1086 
Phe Leu Met Ala Arg Leu His lie Lys Arg lie Ala Val Leu Pro Gly 
220 225 230 

ACT GGT GCC ATC CGC CAA GGT GCC AAT ATG AAG GGA GCG ATT ACC TTG 1134 
Thr Gly Ala lie Arg Gin Gly Ala Asn Met Lys Gly Ala lie Thr Leu 
235 240 245 
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ACC ATC CTG ATT GGC GTC TTT GTT GTC TGC TGG GCC CCA TTC TTC CTC 1182 
Thr He Leu He Gly Val Phe Val Val Cys Trp Ala Pro Phe Phe Leu 
250 255 260 



CAC TTA ATA TTC TAC ATC TCT TGT CCT CAG AAT CCA TAT TGT GTG TGC 1230 
His Leu He "Phe Tyr He Ser Cys Pro Gin Asn Pro Tyr Cys Val Cys 
265 270 275 

TTC ATG TCT CAC TTT AAC TTG TAT CTC ATA CTG ATC ATG TGT AAT TCA 1278 
Phe Met Ser His Phe Asn Leu Tyr Leu He Leu He Met Cys Asn Ser 
280 285 290 295 



ATC ATC GAT CCT CTG ATT TAT GCA CTC CGG AGT CAA GAA CTG AGG AAA 1326 
He He Asp Pro Leu He Tyr Ala Leu Arg Ser Gin Glu Leu Arg Lys 
300 305 310 

ACC TTC AAA GAG ATC ATC TCT TCC TAT CCC CTG GGA GGC CTT TGT GAC 1374 
Thr Phe Lys Glu He He Ser Ser Tyr Pro Leu Gly Gly Leu Cys Asp 
315 320 325 



TTG TCT AGC AGA TAT TAAATGGGGA CAGAGCACGC AATATAGGAA CATCCATAAG 1429 
Leu Ser Ser Arg Tyr 
330 

AGACTTTTTC ACTCTTACCC TACCTGAATA TTCTACTTCT GCAACAGCTT TCTCTTCCGT 1489 

GTAGGGTACT GGTTGAGATA TCCATTGTGT AAATTTAAGC CTATGATTTT TAATGAGAAA 154 9 

AAATGCCCAG TCTCTGTATT ATTTCCAATC TCATGCTACT TTTTTGGCCA TAAAATATGA 1609 



ATCTATGTTA TAGGTTGTAG GCACTGTGGA TTTACAAAAA GAAAAGTCCT TATTAAAAGA 1669 
TT 1671 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Val Asn Ser Thr His Arg Gly Met His Thr Ser Leu His Leu Trp 
15 10 15 



Asn Arg Ser Ser Tyr Arg Leu His Ser Asn Ala $er Glu Ser Leu Gly 
20 25 30 
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Lys Gly Tyr Ser Asp Gly Gly Cys Tyr Ala Gin Leu Phe Val .Ser Pro 
35 40 45 

Glu Val Phe Val Thr Leu Gly Val He Ser Leu Leu Glu Asn He Leu 
50 • 55 60 

Glu He Val Ala He Ala Lys Asn Lys Asn Leu His Ser Pro Met Tyr 
65 70 75 80 

Phe Phe He Cys Ser Leu Ala Val Ala Asp Met Leu Val Ser Val Ser 
85 90 95 

Asn Gly Ser Glu Thr He He He Thr Leu Leu Asn Arg Thr Asp Thr 
100 105 110 

Asp Ala Gin Ser Phe Thr Val Asn He Asp Asn Val He Asp Ser Val 
115 120 125 

He Cys Ser Ser Leu Leu Ala Ser He Cys Ser Leu Leu Ser He Ala 
130 135 140 

Val Asp Arg Tyr Phe Thr He Phe Tyr Ala Leu Gin Tyr His Asn He 
145 150 155 160 

Met Thr Val Lys Arg Val Gly He Ser He Ser Cys He Trp Ala Ala 
165 170 175 

Cys Thr Val Ser Gly He Leu Phe He He Tyr Ser Asp Ser Ser Ala 
180 185 190 

Val He He Cys Leu He Thr Met Phe Phe Thr Met Leu Ala Leu Met 
195 200 205 

Ala Ser Leu Tyr Val His Leu Phe Leu Met Ala Arg Leu His He Lys 
210 215 220 

Arg He Ala Val Leu Pro Gly Thr Gly Ala He Arg Gin Gly Ala Asn 
225 230 235 240 

Met Lys Gly Ala He Thr Leu Thr He Leu lie Gly Val Phe Val Val 
245 250 255 

Cys Trp Ala Pro Phe Phe Leu His Leu He Phe Tyr He Ser Cys Pro 
260 265 270 

Gin Asn Pro Tyr Cys Val Cys Phe Met Ser His Phe Asn Leu Tyr Leu 
275 280 285 



He Leu He Met Cys Asn Ser He He Asp Pro Leu He Tyr Ala Leu 
290 295 300 
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Arg Ser Gin Glu Leu Arg Lys Thr Phe Lys Glu lie lie Ser Ser Tyr 
305 310 315 320 

Pro Leu Gly Gly Leu Cys Asp Leu Ser Ser Arg Tyr 
325 330 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..975 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG AAC TCC TCC TCC ACC CTG ACT GTA TTG AAT CTT ACC CTG AAC GCC 48 
Met Asn Ser Ser Ser Thr Leu Thr Val Leu Asn Leu Thr Leu Asn Ala 
15 10 15 

TCA GAG GAT GGC ATT TTA GGA TCA AAT GTC AAG AAC AAG TCT TTG GCC 96 
Ser Glu Asp Gly lie Leu Gly Ser Asn Val Lys Asn Lys Ser Leu Ala 
20 25 30 

TGT GAA GAA ATG GGC ATT GCC GTG GAG GTG TTC CTG ACC CTG GGT CTC 144 
Cys Glu Glu Met Gly lie Ala Val Glu Val Phe Leu Thr Leu Gly Leu 
35 40 45 

GTC AGC CTC TTA GAG AAC ATC CTG GTC ATT GGG GCC ATA GTA AAG AAC 192 
Val Ser Leu Leu Glu Asn He Leu Val He Gly Ala He Val Lys Asn 
50 55 60 

AAA AAC CTG CAC TCA CCC ATG TAC TTC TTT GTG GGC AGC TTA GCC GTG 24 0 

Lys Asn Leu His Ser Pro Met Tyr Phe Phe Val Gly Ser Leu Ala Val 
65 70 75 80 

GCC GAC ATG CTG GTG AGC ATG TCC AAT GCC TGG GAG ACT GTC ACC ATA 2 88 

Ala Asp Met Leu Val Ser Met Ser Asn Ala Trp Glu Thr Val Thr He 
85 90 95 

TAC TTG CTA AAT AAT AAA CAC CTG GTG ATA GCC GAC ACC TTT GTG CGA 336 
Tyr Leu Leu Asn Asn Lys His Leu Val He Ala Asp Thr Phe Val Arg 
100 105 110 
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CAC ATC GAC AAC GTG TTC GAC TCC ATG ATC TGC ATC TCT GTG GTG GCC 3 64 

His He Asp Asn Val Phe Asp Ser Met He Cys He Ser Val Val Ala 
115 120 125 

TCG ATG TGC AGT TTG CTG GCC ATT GCG GTG GAT AGG TAC ATC ACC ATC 432 
Ser Met Cys Ser Leu Leu Ala He Ala Val Asp Arg Tyr He Thr He 
130 135 140 

TTC TAT GCC TTG CGC TAC CAC CAC ATC ATG ACC GCG AGG CGC TCG GGG 480 
Phe Tyr Ala Leu Arg Tyr His His He Met Thr Ala Arg Arg Ser Gly 
145 150 155 160 

GTG ATC ATC GCC TGC ATT TGG ACC TTC TGC ATA AGC TGC GGC ATT GTT 528 
Val He He Ala Cys He Trp Thr Phe Cys He Ser Cys Gly He Val 
165 170 175 

TTC ATC ATC TAC TAT GAG TCC AAG TAT GTG ATC ATT TGC CTC ATC TCC 576 
Phe He He Tyr Tyr Glu Ser Lys Tyr Val He He Cys Leu He Ser 
180 185 190 

ATG TTC TTC ACC ATG CTG TTC TTC ATG GTG TCT CTG TAT ATA CAC ATG 624 
Met Phe Phe Thr Met Leu Phe Phe Met Val Ser Leu Tyr He His Met 
195 200 205 

TTC CTC CTG GCC CGG AAC CAT GTC AAG CGG ATA GCA GCT TCC CCC AGA 672 
Phe Leu Leu Ala Arg Asn His Val Lys Arg He Ala Ala Ser Pro Arg 
210 215 220 

TAC AAC TCC GTG AGG CAA AGG ACC AGC ATG AAG GGG GCT ATT ACC CTC 720 
Tyr Asn Ser Val Arg Gin Arg Thr Ser Met Lys Gly Ala He Thr Leu 
225 230 235 240 

ACC ATG CTA CTG GGG ATT TTC ATT GTC TGC TGG TCT CCC TTC TTT CTT 768 
Thr Met Leu Leu Gly He Phe He Val Cys Trp Ser Pro Phe Phe Leu 
245 250 255 

CAC CTT ATC TTA ATG ATC TCC TGC CCT CAG AAC GTC TAC TGC TCT TGC 816 
His Leu He Leu Met He Ser Cys Pro Gin Asn Val Tyr Cys Ser Cys 
260 265 270 

TTT ATG TCT TAC TTC AAC ATG TAC CTT ATA CTC ATC ATG TGC AAC TCC 864 
Phe Met Ser Tyr Phe Asn Met Tyr Leu He Leu He Met Cys Asn Ser 
275 280 285 

GTG ATC GAT CCT CTC ATC TAC GCC CTC CGC AGC CAA GAG ATG CGG AGG 912 
Val He Asp Pro Leu He Tyr Ala Leu Arg Ser Gin Glu Met Arg Arg 
290 295 300 

ACC TTT AAG GAG ATC GTC TGT TGT CAC GGA TTC CGG CGA CCT TGT AGG 960 
Thr Phe Lys Glu He Val Cys Cys His Gly Phe Arg Arg Pro Cys Arg 
305 310 315 320 
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CTC CTT GGC GGG TAT TAA 
Leu Leu Gly Gly Tyr 
325 



978 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Asn Ser Ser Ser Thr Leu Thr Val Leu Asn Leu Thr Leu Asn Ala 
15 10 15 

Ser Glu Asp Gly He Leu Gly Ser Asn Val Lys Asn Lys Ser Leu Ala 
20 25 30 

Cys Glu Glu Met Gly He Ala Val Glu Val Phe Leu Thr Leu Gly Leu 
35 40 45 

Val Ser Leu Leu Glu Asn He Leu Val He Gly Ala He Val Lys Asn 
50 55 60 

Lys Asn Leu His Ser Pro Met Tyr Phe Phe Val Gly Ser Leu Ala Val 
65 70 75 80 

Ala Asp Met Leu Val Ser Met Ser Asn Ala Trp Glu Thr Val Thr He 
85 90 95 

Tyr Leu Leu Asn Asn Lys His Leu Val He Ala Asp Thr Phe Val Arg 
100 105 110 

His He Asp Asn Val Phe Asp Ser Met He Cys He Ser Val Val Ala 
115 120 125 

Ser Met Cys Ser Leu Leu Ala He Ala Val Asp Arg Tyr lie Thr He 
130 135 140 

Phe Tyr Ala Leu Arg Tyr His His He Met Thr Ala Arg Arg Ser Gly 
145 150 155 160 

Val He He Ala Cys He Trp Thr Phe Cys He ?er Cys Gly He Val 
165 170 175 
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Phe lie He Tyr Tyr Glu Ser Lys Tyr Val He He Cys Leu He Ser 
180 185 190 



Met Phe Phe Thr Met Leu Phe Phe Met Val Ser Leu Tyr He His Met 
195 - 200 205 

Phe Leu Leu Ala Arg Asn His Val Lys Arg He Ala Ala Ser Pro Arg 
210 215 220_ 

Tyr Asn Ser Val Arg Gin Arg Thr Ser Met Lys Gly Ala He Thr Leu 

225 230 235 240 



Thr Met Leu Leu Gly He Phe He Val Cys Trp Ser Pro Phe Phe Leu 
245 250 255 



His Leu He Leu Met He Ser Cys 
260 

Phe Met Ser Tyr Phe Asn Met Tyr 
275 280 

Val He Asp Pro Leu He Tyr Ala 
290 295 



Pro Gin Asn Val Tyr Cys Ser Cys 
265 270 

Leu He Leu He Met Cys Asn Ser 
285 

Leu Arg Ser Gin Glu Met Arg Arg 
300 



Thr Phe Lys Glu He Val Cys Cys His Gly Phe Arg Arg Pro Cys Arg 
305 310 315 320 

Leu Leu Gly Gly Tyr 
325 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1. .32 

(D) OTHER INFORMATION: /function = "Degenerate 
oligonucleotide primer (antisense) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GAATTCGACG TCACAGTATG ACGGCCATGG 30 
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WHAT WE CLAIM IS: 

1 . A method for characterizing a compound as an agonist of a mammalian 
melanocoitin receptor, the method comprising the steps of: 

(a) providing a panel comprising a first mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocoitin receptor that is 
the a-MSH receptor, a second mammalian cell comprising a recombinant expression 
construct encoding a mammalian melanocoitin receptor that is the ACTH receptor, a 
third mammalian cell comprising a recombinant expression construct encoding a 
mammalian melanocoitin receptor that is the MC-3 receptor, a fourth mammalian cell 
comprising a recombinant expression construct encoding a mammalian melanocoitin 
receptor that is the MC-4 receptor, and a fifth mammalian cell comprising a recombinant 
expression construct encoding a mammalian melanocoitin receptor that is the MC-5 
receptor, wherein each mammalian cell expresses the melanocortin receptor encoded by 
the recombinant expression construct comprising the cell; 

(b) contacting each of the cells of the panel with a test compound to be 
characterized as an agonist of a mammalian melanocortin receptor; 

(c) detecting binding of the test compound to each of the mammalian 
melanocortin receptors by assaying for a metabolite produced in the cells that bind the 
compound. 

2. The method of claim 1 , wherein the metabolite detected in subpart (c) is 
cyclic AMP. 

3. The method of claim 1, each of the cells further comprising a 
recombinant expression construct encoding a cyclic AMP responsive element (CRE) 
transcription factor binding site operatively linked to a nucleic acid sequence encoding 
a protein capable of producing a detectable metabolite. 

4. The method of claim 3, wherein the nucleic acid sequence encodes P- 
galactosidase. 
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5. The method of claim 3, wherein the recombinant expression construct is 
pCRE/p-galactosidase. 

6. The method of claim 3, wherein the detectable metabolite produced by 
5 the protein encoded by the recombinant expression construct is produced by binding of 

the test compound to the mammalian melanocortin receptor encoded by each of the cells 
of the panel. 

7. A method for characterizing a compound as an antagonist of a 
10 mammalian melanocortin receptor, the method comprising the steps of: 

(a) providing a panel comprising a first mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocortin receptor that is 
the a-MSH receptor, a second mammalian cell comprising a recombinant expression 
construct encoding a mammalian melanocortin receptor that is the ACTH receptor, a 

15 third mammalian cell comprising a recombinant expression construct encoding a 

mammalian melanocortin receptor that is the MC-3 receptor, a fourth mammalian cell 
comprising a recombinant expression construct encoding a mammalian melanocortin 
receptor that is the MC-4 receptor, and a fifth mammalian cell comprising a recombinant 
expression construct encoding a mammalian melanocortin receptor that is the MC-5 

20 receptor, wherein each mammalian cell expresses the melanocortin receptor encoded by 
the recombinant expression construct comprising the cell; 

(b) contacting each of the cells of the panel with an agonist of the 
mammalian melanocortin receptor in an amount sufficient to produce a detectable 
amount of a metabolite produced in the cells that bind the agonist, in the presence or 

25 absence of a test compound to be characterized as an antagonist of a mammalian 

melanocortin receptor; 

(c) detecting the amount of the metabolite produced in each cell in the panel 
in the presence of the test compound with the amount of the metabolite produced in each 
cell in the panel in the absence. 

30 

8. The method of claim 7, wherein the metabolite detected in subpart (c) is 
cyclic AMP. 
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9. The method of claim 7, each of the cells further comprising a 
recombinant expression construct encoding a cyclic AMP responsive element (CRE) 
transcription factor binding site operatively linked to a nucleic acid sequence encoding 
a protein capable of producing a detectable metabolite. 

10. The method of claim 9, wherein the nucleic acid sequence encodes P~ 
galactosidase. 

1 1 . The method of claim 9, wherein the recombinant expression construct is 
pCRE/p-galactosidase. 

12. The method of claim 9, wherein the detectable metabolite produced by 
the protein encoded by the recombinant expression construct is produced by binding of 
the test compound to the mammalian melanocortin receptor encoded by each of the cells 
of the panel. 

13. The method of claim 1 wherein the test compound is an agonist of the 
MC-3 mammalian melanocortin receptor. 

14. The method of claim 1 wherein the test compound is an agonist of the 
MC-4 mammalian melanocortin receptor. 

15. The method of claim 3 wherein the test compound is an agonist of the 
MC-3 mammalian melanocortin receptor. 

16. The method of claim 3 wherein the test compound is an agonist of the 
MC-4 mammalian melanocortin receptor. 

1 7. The method of claim 7 wherein the test compound is an antagonist of the 
MC-3 mammalian melanocortin receptor. 
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1 8. The method of claim 7 wherein the test compound is an antagonist of the 
MC-4 mammalian melanocortin receptor. 

19. The method of claim 9 wherein the test compound is an antagonist of the 
5 MC-3 mammalian melanocortin receptor. 

20. The method of claim 9 wherein the test compound is an antagonist of the 
MC-4 mammalian melanocortin receptor. 

10 2 1 . A mammalian melanocortin MC-3 receptor agonist according to claims 

13 or 15. 

22. A mammalian melanocortin MC-4 receptor agonist according to claims 

14 or 16. 

15 

23. A mammalian melanocortin MC-3 receptor antagonist according to 
claims 17 or 19. 

24. A mammalian melanocortin MC-4 receptor antagonist according to 
20 claims 18 or 20. 

25. A method of inhibiting feeding behavior in an animal, the method 
comprising administering an effective amount of a mammalian melanocortin MC-3 or 
MC-4 receptor agonist according to claim 21 . 

25 

26. A method of stimulating feeding behavior in an animal, the method 
comprising-administering an effective amount of a mammalian melanocortin MC-3 or 
MC-4 receptor antagonist according to claim 24. 

30 27. A method for characterizing a mammalian melanocortin MC-3 or MC-4 

receptor agonist as an inhibitor of feeding behavior in an animal, the method comprising: 
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(a) providing food to an animal that has been deprived of food for at least 1 2 
hours with or without administering to the animal a mammalian melanocortin MC-3 or 
MC-4 receptor agonist according to claim 25; and 

(b) comparing the amount of food eaten by the animal with and without 
administration of the mammalian melanocortin MC-3 or MC-4 receptor agonist, 

28. A method for characterizing a mammalian melanocortin MC-3 or MC-4 
receptor antagonist as a stimulator of feeding behavior in an animal, the method 
comprising: 

(a) providing food to an animal that has not been otherwise deprived of food 
for at least 12 hours, with or without administering to the animal a mammalian 
melanocortin MC-3 or MC-4 receptor antagonist according to claim 26 immediately 
prior to the onset of darkness or nighttime; and 

(b) comparing the amount of food eaten by the animal with and without 
administration of the mammalian melanocortin MC-3 or MC-4 receptor antagonist. 

29. A mammalian melanocortin MC-3 or MC-4 receptor agonist having the 
general formula: 

A-B-C-D-E-F-G-amide 
wherein A is Leu, lie, Nle, Met, or substituted analogues thereof; 

B is Asp, Glu , or substituted analogues thereof; 

C is His or substituted analogues thereof; 

D is D-Phe, D-Tyr or substituted analogues thereof; 

E is Arg, Lys, homoArg, homoLys, or substituted analogues thereof; 

F is Tip or substituted analogues thereof; 

G is Lys, homoLys or substituted analogues thereof; 
and wherein the peptide is cyclized by the formation of an amide bond between the side 
chain carboxyl group of the Asp or Glu residue at position B in the peptide, and the side 
chain amino group of the Lys or homoLys residue at position G. 
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30. A mammalian melanocortin MC-3 or MC-4 receptor antagonist having 
the general formula: 

A-B-C-D-E-F-G-amide 
wherein A is Leu, He, Nle, Met, or substituted analogues thereof; 

B is Asp, Glu or substituted analogues thereof; 

C is His or substituted analogues thereof; 

D is D-Nal or substituted analogues thereof; 

E is Arg, Lys, homoArg, homoLys or substituted analogues thereof; 

F is Trp or substituted analogues thereof; 

G is Lys, homoLys or substituted analogues thereof; 
and wherein the peptide is cyclized by the formation of an amide bond between the side 
chain carboxyl group of the Asp or Glu residue at position B in the peptide, and the side 
chain amino group of the Lys or homoLys residue at position G. 

31. A biological screening panel for determining the receptor 
agonist/antagonist profile of a test compound, the panel comprising a first mammalian 
cell comprising a recombinant expression construct encoding a mammalian melanocortin 
receptor that is the cc-MSH receptor, a second mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocortin receptor that is 
the ACTH receptor, a third mammalian cell comprising a recombinant expression 
construct encoding a mammalian melanocortin receptor that is the MC-3 receptor, a 
fourth mammalian cell comprising a recombinant expression construct encoding a 
mammalian melanocortin receptor that is the MC-4 receptor, and a fifth mammalian cell 
comprising a recombinant expression construct encoding a mammalian melanocortin 
receptor that is the MC-5 receptor, wherein each mammalian cell expresses the 
melanocortin receptor encoded by the recombinant expression construct comprising the 
cell. 
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FIG. 1A 
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FIG. IB 



TCC AOC ACC CTC TTT ATC ACC TAG TAG AAO CAC ACA OCC OTT CTG CTC 578 
Ser Ser Thr Leu Phe lie Thr Tyr Tyr Lya Hia Thr Ala Val Leu Leu 
175 100 IBS 

TGC CTC QTC ACT TTC TTT CTA OCC ATO CTG OCA CTC ATG GCO ATT CTG 626 
Cys Leu Val Thr Phe Phe Leu Ala Met Leu Ala Leu Met Ala He Leu 
190 195 200 

TAT OCC CAC ATO TTC ACQ AOA OCO TOC CAO CAC OTC CAQ OOC ATT OCC 674 
Tyr Ala Hia Met Phe Thr Arg Ala Cya Oln Hie Val Oln Oly lie Ala 
205 210 215 220 

CAO CTC CAC AAA AGO COO COG TCC ATC COC CAA OOC TTC TGC CTC AAG 722 
Oln Leu Hia Lya Arg Arg Arg Ser He Arg Oln Oly Phe Cya Leu Lye 
225 230 235 

GOT OCT OCC ACC CTT ACT ATC CTT CTG GOG ATT TTC TTC CTO TGC TOO 770 
Gly Ala Ala Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cya Trp 
240 245 250 

OOC CCC TTC TTC CTG CAT CTC TTG CTC ATC OTC CTC TOC CCT CAO CAC 818 
Gly Pro Phe Phe Leu Hia Leu Leu Leu lie Val Leu Cya Pro Gin Hia 
255 260 265 

CCC ACC TGC AOC TGC ATC TTC AAG AAC TTC AAC CTC TTC* CTC CTC CTC 866 
Pro Thr Cya Ser Cya He Phe Lya Aan Phe Aan Leu Phe Leu Leu Leu 
270 275 280 

ATC OTC CTC AOC TOC ACT OTT GAC CCC CTC ATC TAT OCT TTC COC AOC 914 
He Val Leu Ser Ser Thr Val Asp Pro Leu He Tyr Ala Phe Arg Ser 
285 290 295 300 

CAO GAG CTC COC ATG ACA CTC AAO GAG GTG CTO CTO TGC TCC TOG 959 
Oln Olu Leu Arg Met Thr Leu Lya Glu Val Leu Leu Cya Ser Trp 
305 310 315 

TGATCAOAGG GCGCTGGGCA GAGGGTOACA GTOATATCCA GTGGCCTGCA TCTOTGAOAC 1019 

CACAOGTACT CATCCCTTCC TOATCTCCAT TTOTCTAAOO OTCGACAGGA TGAGCTTTAA 1079 

AATAOAAACC CAOAOTGCCT GOGGCCAGOA OAAAOGOTAA CTOTOACTOC AGGGCTCACC 1139 

CAOOGCAOCT ACGGGAAGTG GAGGAOAGAO OOATGGGAAC TCTAGCCCTO AGCAAOGOTC 1199 

AOACCACAGG CTCCTOAAOA OCTTCACCTC TCCCCACCTA CAGGCAACTC CTGCTCAAOC 1259 

C 1260 
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FIG. 2 A 



CCCaCATOTO OCCaCCCTCA ATOGAOOGCT CTOAOAACOA CTTTTAAAAC QCAOAOAAAA 60 

AQCTCCATTC TTCCCAGACC TCAGCGCAGC CCTOaCCCAG QAAOQGAOOA OACAOAOOCC 120 

AOGACGOTCC AGAOGTGTCG AAATOTCCTO GGAACCTGAG CAOCAGCCAC GAGGGAAGAG X80 

QCAGOOAGGO AGCTGAGGAC CAGGCTTGGT TOTGAGAATC CCTGAGCCCA GOCGGTTGAT 240 

OCCAOOAOOT OTCTOOACTO GCTOOGCCAT OCCTOOOCTG ACCTOTCCAQ CCAGGOAGAG 300 

OOTOTOAOaO CAOATCTOGG GGTGCCCAGA TGOAAOOAOG CAGGCATGGG GACACCCAAG 360 

GCCCCCTOGC AGCACCATGA ACTAAGCAGG ACACCTGGAG GGGAAGAACT GTGGGGACCT 420 

GGAGGCCTCC AACOACTCCT T C CT Q CTTC C TGGACAGGAC T ATO GCT GTG CAG 473 

Met Ala Val Gin 
1 

GGA TCC GAG AGA AGA CTT CTG GGC TCC CTC AAC TCC ACC CCC ACA GCC 521 
Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Aon Ser Thr Pro Thr Ala 
5 10 IS 20 

ATC CCC CAG CTG GOG CTG GCT GCC AAC CAG ACA GGA GCC CGG TGC CTG 569 
He Pro Gin Leu Gly Leu Ala Ala Ann Gin Thr Gly Ala Arg Cys Leu 
25 30 35 

GAG GTG TCC ATC TCT GAC GGG CTC TTC CTC AGC CTG GGG CTG GTG AGC 617 
Glu Val Ser lie Ser Asp Gly Leu Phe Leu Ser Leu Gly Leu Val Ser 
40 45 SO 

TTG GTG GAG AAC GCG CTG GTO GTG GCC ACC ATC GCC AAQ AAC CGG AAC 665 
Leu Val Glu Abu Ala Leu Val Val Ala Thr He Ala Lys Asn Arg Asn 
55 60 65 

CTG CAC TCA CCC ATO TAC TGC TTC ATC TGC TGC CTG GCC TTG TCG GAC 713 
Leu His Ser Pro Met Tyr Cye Phe He Cye Cys Leu Ala Leu Ser Asp 
70 75 80 

CTG CTG GTG AGC OGG ACQ AAC GTG CTG GAG ACQ GCC GTC ATC CTC CTG 761 
Leu Leu Val Ser Gly Thr Asn Val Leu Glu Thr Ala Val He Leu Leu 
65 90 95 100 

CTG GAG GCC GOT GCA CTG GTG GCC CGG GCT GCG GTG CTG CAG CAG CIO 809 
Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val Leu Gin Gin Leu 
105 no lis 

GAC AAT GTC ATT GAC GTG ATC ACC TGC AGC TCC ATG CTG TCC AGC CTC 857 
Asp Asn Val He Asp Val He Thr Cys Ser Ser Met Leu Ser Ser Leu 
120 125 130 

TGC TTC CTG GGC GCC ATC GCC GTG GAC CGC TAC ATC TCC ATC TTC TAC 905 
Cys Phe Xjeu Gly Ala. He Ala Val Asp Arg Tyr He Ser He Phe Tyr 
135 140 145 

GCA CTG CGC TAC CAC AGC ATC GTG ACC CTG CCG CGG GCG CCO COA GCC 953 
Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg Ala Pro Arg Ala 
ISO 155 160 
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FIG. 2B 



ott oco acc atc too oto acc aot ore arc ttc aoc acq ctc ttc atc 
VaX Ala Ala lie Trp Val Ala Ser Val Val Phe Ser Thr Leu Phe lie 
l«5 170 175 180 



1001 



OCC TAC TAG OAC CAC OTO OCC OTC CTO CTO TOC CTC OTO OTC TTC TTC 1049 
Ala Tyx Tyx Asp His Val Ala Val Leu Leu Cya Leu Val Val Phe Phe 
185 190 195 

CTO OCT ATO CTO OTO CTC ATO OCC OTO CTO TAC OTC CAC ATQ CTO OCC 1097 
Leu Ala Met Leu Val Leu Met Ala Val Leu Tyr Val His Met Leu Ala 
200 205 210 

COO OCC TOC CAO CAC OCC CAO OGC ATC OCC COO CTC CAC AAO AGO CAO 1145 
Arg Ala Cya Gin His Ala Ola Oly lie Ala Arg Leu His Lya Arg Gin 
21S 220 22S 

COC CCO OTC CAC CAO OGC TTT OOC CTT AAA OGC OCT OTC ACC CTC ACC 1193 
Arg Pro Val Bis Oln Oly Phe Oly Leu Lya Oly Ala Val Thr Leu Thr 
230 235 240 

ATC CTO CTQ GOC ATT TTC TTC CTC TOC TOO GGC CCC TTC TTC CTO CAT 1241 
lie Leu Leu Oly He Phe Phe Leu Cys Trp Oly Pro Phe Phe Leu His 
24S 2S0 255 260 

CTC ACA CTC ATC OTC CTC TOC CCC GAG CAC CCC ACQ TOC OGC TOC ATC 1289 
Leu Thr Leu lie Val Leu Cys Pro Glu His Pro Thr Cya Gly Cya lie 
265 270 275 

TTC AAO AAC TTC AAC CTC TTT CTC GCC CTC ATC ATC TOC AAT OCC ATC 1337 
Phe Lya Asn Phe Aan Leu Phe Leu Ala Leu He lie Cys Asn Ala He 
280 285 290 

ATC GAC CCC CTC ATC TAC GCC TTC CAC AOC CAG GAG CTC COC AGO ACQ 1385 
He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu Leu Arg Arg Thr 
295 300 305 

CTC AAG GAG OTO CTO ACA TGC TCC TOO TGAGCOCGGT GCAOGCGCTT 1432 
Leu Lya Olu Val Leu Thr Cya Ser Trp 
310 315 

TAAGTGTOCT OGOCAaAOGO AGGTGGTGAT ATTOTQGTCT GGTTCCTOTG TOACCCTOGG 1492 

GAGRCCRA CCTCCCTGGT CCCCGTTTOT CAAAGAOGAT GGACTAAATO ATCTCTOAAA 15S2 

GTOTTGAAGC GCGOACCCTT CTOOGCAGGG AOGGOTCCTG CAAAACTCCA GOCAGGACTT 1612 

CTCACCAGCA OTOOTGGOAA C x633 
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FIG. 3 A 



acaacacttt atatatattt ttataaatot aaooggtaca aaggtoccat tttottacat 60 

ooatatacco totagtogto aagcctcggc ttttagtota tctqtcatca oaataacata 120 

cotoitagcc ataooaattt ctcatcaccc qccccctcca cccttcgaot ctccaatotc ibo 

cattccacac tctatatcca cgtotatoca tataoctcca catataaqtq aoaacatota 240 

OTATTTOACT TCCTCTTTCT GAOTTATTTC ACTTTOATAA TOOCCTCCAC TTCCATCCAT 300 

GTTOCTOCAA AAOACATOAC CTTATTCTTT TTOATAGCTO GGGAGTACTC CATTOTOTAT 360 

ATOTACCACA TTTCTTTATC CATTCACCCA TTOAOAACAC TTAQTTOATT CCATATCTTT 420 

GCTATTOTCA CTAOTGCTCC AATAAACATA CATOTOCAG0 CTCCTTCTAA TATACTOATT 480 

TATATTTTAT GGAGAGAGAT AOAGTTCTTA GCQAOTGTGC TOTTTATTTC TAOTGTACTT 540 

OCAACTAATA TTCTOTATAC TCCCTTTAOG TOATTCGAGA TTTAACTTAG ATCTCCAQCA 600 

AOTGCTACAA GAAGAAAAOA TCCTGAAGAA TCAATCAAGT TTCC6TGAAG TCAAGTCCAA 660 

OTAACATCCC CQCCTTAACC ACAAQCAGOA GAA ATO AAQ CAC ATT ATC AAC TCO 714 

Met Lye His lie Zle Ann Ser 
1 S 

TAT GAA AAC ATC AAC AAC ACA OCA AOA AAT AAT TCC OAC TOT CCT COT 762 
Tyr Olu Afin He Asn Asu Thr Ala Arg Aon Asa Ser Asp Cyo Pro Arg 
10 15 20 



OTO OTT TTO CCQ OAO GAG ATA TTT TTC ACA ATT TCC ATT OTT QGA GTT 
Val Val Leu Pro Glu Glu lie Pne Phe Thr He Ser He Val Gly Val 
25 30 35 



CAO OCA CCC ATO TAC TTT TTC ATC TOT AGC TTO GCC ATA TCT GAT ATO 
Gin Ala Pro Met Tyr Phe Phe He Cyu Ser Leu Ala He Ser Asp Met 
60 65 70 



AAC ATO GGC TAT CTC AAG CCA COT GGC AGT TTT GAA ACC ACA GCC GAT 
Asn Met Gly Tyr Leu Lys Pro Arg Gly Ser Phe Glu Thr Thr Ala Asp 
90 95 100 



S/tf(p 



810 



TTO GAG AAT CTO ATC GTC CTO CTO OCT OTO TTC AAG AAT AAG AAT CTC 858 
Leu Glu Asn Leu He Val Leu Leu Ala Val Phe Lys Asn Lys Asn Leu 
40 45 50 55 



906 



CTO GGC AGC CTA TAT AAG ATC TTO GAA AAT ATC CTO ATC ATA TTO AOA 954 
Leu Gly Ser Leu Tyr Lys He Leu Glu Asn He Leu He He Leu Arg 
75 80 8S 



1002 



OAC ATC ATC GAC TCC CTO TTT GTC CTC TCC CTO CTT GGC TCC ATC TTC 1050 
Asp He He Asp ser Leu Phe Val Leu Ser Leu Leu Gly Ser He Phe 
105 no us 
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FIG. 3B 



aoc era tct ara att oct aca gac cac tag atc acc atc ttc cac oca ±09b 
Ser Leu Ser Val lie Ala Ala Asp Arg Tyx He Thr He Phe His Ala 
120 125 130 135 

CTO CGG TAC CAC AOC ATC OTO ACC ATO CGC COC ACT GTQ GT3 GT3 CTT 1146 
Leu Arg Tyr Hie Ser He Val Thr Met Arg Arg Tfar Val Val Val Leu 
140 145 150 

ACQ OTC ATC TOO ACQ TTC TOC ACQ GOG ACT GGC ATC ACC ATO 0T0 ATC 1194 
Thr Val lie Trp Thr Phe Cys Thr Oly Thr Gly He Thr Met Val He 
15S 160 165 

TTC TCC CAT CAT OTO CCC ACA OTO ATC ACC TTC ACG TCG CTO TTC CCG 1242 
Phe Ser His Hie Val Pro Thr Val He Thr Phe Thr Ser Leu Phe Pro 
170 175 1B0 

CTG ATO CTO OTC TTC ATC CTO TOC CTC TAT OTC CAC ATO TTC CTO CTO 1290 
Leu Met Leu Val Phe He Leu Cyo Leu Tyr Val His Met Phe Leu Leu 
185 190 195 

OCT COA TCC CAC ACC AGO AAG ATC TCC ACC CTC CCC AGA OCC AAC ATO 1338 
Ala Arg Ser His Thr Arg Lye He Ser Thr Leu Pro Arg Ala Aen Met 
200 205 210 215 

AAA GOG OCC ATC ACA CTG ACC ATC CTO CTC GGG GTC TTC ATC TTC TOC 1386 
Lye Gly Ala He Thr Leu Thr He Leu Leu Gly Val Phe He Phe Cys 
220 225 230 

TOO OCC CCC TTT QTG CTT CAT GTC CTC TTO ATG ACA TTC TOC CCA AOT 1434 
Trp Ala Pro Phe Val Leu His Val Leu Leu Met Thr Phe Cys Pro Ser 
235 240 245 

AAC CCC TAC TOC OCC TOC TAC ATG TCT CTC TTC CAO OTO AAC GOC ATO 1482 
Asn Pro Tyr Cys Ala Cyo Tyr Met Ser Leu Phe Gin Val Asn Gly Met 
250 255 260 

TTO ATC ATG TOC AAT OCC GTC ATT OAC CCC TTC ATA TAT OCC TTC COO 1530 
Leu He Met Cys Asn Ala Val He Asp Pro Phe He Tyr Ala Phe Arg 
265 270 275 



AOC CCA GAG CTC AGO OAC OCA TTC AAA AAG ATG ATC TTC TQC AOC AGO 
Ser Pro Glu Leu Arg Asp Ala Phe Lye Lys Met He Phe Cys Ser Arg 
280 285 290 295 



1578 



TAC TOO TAGAATOOCT GATCCCTOOT TTTAGAATCC ATGOOAATAA CGTTGCCAAO 1634 
Tyr Trp 

TOCCAGAATA GTOTAACATT CCAACAAATO CCAGTOCTCC TCACTOOCCT TCCTTCCCTA 1694 

ATOGATOCAA OOATOACCCA CCAOCTAOTG TTTCTGAATA CTATGOCCAO QAAGAGTCTA 1754 

TTGTAGGGOC AACTCTATTT GTOACTOOAC AGATAAAACG TOTAGTAAAA OAAGOATAGA 1814 

ATACAAAGTA TTAGGTACAA AAOTAATTAO GTTTQCATTA CTTATOACAA AT0CATTACT 1874 

TTTOCACCAA TCTAGTAAAA CAOCAATAAA AATTCAAGGG CTTTOOGCTA AOGCAAAGAC 1934 

TTGCTTTCCT GTGOACATTA ACAAOCCAOT TCTGAOOCGG CCTTTCCAGO TGOAOOCCAT 1994 

TOCAGCCAAT TTCAGAOT ^ 2012 



SUBSTITUTE SHEET (RULE 26) 



WO 98/10068 



PCT/US97/I5565 



FIG. 4A 



OOOGCCAOAA AQTTCCTQCT TCAOAOCAQA AOATCTTCAG CAAOAACTAC AAAGAAQAAA 60 

AOATTCTGOA GAATCAATCA AOTTTCCTGT CAAOTTCCAG TAACGTTTCT OTCTTAACTG 120 

CACACAOOAA AG ATO AAA CAC ATT CTC AAT CTO TAT OAA AAC ATC AAC 166 
Met Lye His lie Leu Asn Leu Tyr Olu Asn He Asn 
15 10 

AOT ACA OCA AGA AAT AAC TCA OAC TOT CCT OCT OTO ATT TTO CCA OAA 216 
Ser Thr Ala Arg Aen Asn Ser Asp Cya Pro Ala Val lie Leu Pro Olu 
15 20 25 

GAG ATA TTT TTC ACA GTA TCC ATT GTT GOO OTT TTO GAG AAC CTG ATO 264 
Olu He Phe Phe Thr Val Ser He Val Gly Val Leu Glu Asn Leu Met 
30 35 40 

OTC CTT CTG OCT OTG GCC AAG AAT AAG AOT CTT CAO TCG CCC ATO TAC 312 
Val Leu Leu Ala Val Ala Lye Asa Lys Ser Leu Gin Ser Pro Met Tyr 
45 50 55 60 

TTT TTC ATC TGC AGO TTG OCT ATT TCC OAT ATO CTG GGO AGC CTG TAC 360 
Phe Phe lie Cys Ser Leu Ala lie Ser Asp Met Leu Gly Ser Leu Tyr 
65 70 75 

AAG ATT TTO OAA AAC GTT CTO ATC ATG TTC AAA AAC ATO GOT TAC CTC 408 
Lys He Leu Glu Asn Val Leu lie Met Phe Lye Asn Met Gly Tyr Leu 
80 85 90 

GAG CCT CGA GGC AOT TTT GAA AGC ACA OCA GAT GAT GTO GTG GAC TCC 456 
Glu Pro Arg Gly Ser Phe Glu Ser Thr Ala Asp Asp Val Val Asp Ser 
95 100 105 

CTO TTC ATC CTC TCC CTT CTC GGC TCC ATC TGC AGC CTG TCT GTG ATT 504 
Leu Phe He Leu Ser Leu Leu Gly Ser He Cys Ser Leu Ser Val He 
110 115 120 

GCC GCT GAC COC TAC ATC ACA ATC TTC CAC OCT CTO CAO TAC CAC CGC 552 
Ala Ala Asp Arg Tyr He Thr He Phe His Ala Leu Gin Tyr His Arg 
125 130 135 140 

ATC ATG ACC CCC GCA CCG TGC CCT CGT CAT CTG ACQ GTC CTC TGG GCA 600 
He Met Thr Pro Ala Pro Cya Pro Arg His Leu Thr Val Leu Trp Ala 
145 150 155 

GGC TGC ACA GGC AGT GGC ATT ACC ATC GTG ACC TTC TCC CAT CAC GTC 648 
Gly Cys Thr Gly Ser Gly He Thr He Val Thr Phe Ser His His Val 
160 165 170 

CCC ACA GTG ATC GCC TTC ACA GCO CTG TTC CCG CTG ATO CTG GCC TTC 696 
Pro Thr Val He Ala Phe Thr Ala Leu Phe Pro Leu Met Leu Ala Phe 
175 180 185 

ATC CTG TGC CTC TAC GTG CAC ATO TTC CTG CTG GCC CGC TCC CAC ACC 744 
He Leu Cys Leu Tyr Val His Met Phe Leu Leu Ala Arg Ser His Thr 
190 195 200 

AGO AGO ACC CCC TCC CTT CCC AAA GCC AAC ATG AGA GGG GCC GTC ACA 792 
Arg Arg Thr Pro Ser Leu Pro Lys Ala Asn Met Arg Gly Ala Val Thr 
205 210 215 220 
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FIG. 4B 



era act otc ctq ctc oaa gtc ttc att ttc tot too oca ccc ttt otc 840 

Leu Thr VaX Leu Leu Oly Val Phe lie Phe Cye Trp Al* Pro Phe Val 
225 230 235 

CTT CAT OTC CTC TTO ATO ACA TTC TQC CCA OCT GAC CCC TAC TOT OCC 886 
Leu His V&l Leu Leu Met Thr Phe Cye Pro Ala Asp Pro Tyr Cye Ala 
240 245 250 

TQC TAC ATO TCC CTC TTC CAO GTO AAT GOT GTO TTO ATC ATO TOT AAT 936 
Cys Tyr Met Ser Leu Phe Oln Val Asa, Oly Val Leu He Met Cye Asn 
255 260 265 

OCC ATC ATC GAC CCC TTC ATA TAT OCC TTT COO AOC CCA GAG CTC AGG 984 
Ala lie lie Asp Pro Phe lie Tyr Ala Phe Arg Ser Pro Glu Leu Arg 
270 275 280 

OTC GCA TTC AAA AAO ATO GTT ATC TOC AAC TOT TAC CAG TAGAATOATT 1033 
Val Ala Phe Lys Lys Met Val He Cys Asn Cys Tyr Gin 
265 290 295 

GGTCCCTOAT TTTAGGAGCC ACAGGGATAT ACTOTCAGOG ACAGAGTAGC GTCACAGACC 1093 
AACAACACTA GGACT 1106 
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FIG. 5 A 



GGCTGTAACT GTAGCAACCG QTOTTOOOTO OaGATGAOAA GAOACCAOAG AGAQAGAGGG 60 

TCAGAGCOAC AGOGGATQAG ACAOQCTOOT CAOAGTCTGC ACTGATTOTT aOAGACGCAA 120 

AGGAAAOTTT TTTCTATOTC TCCAACCTCC CCCTCCTCCC C CQTT T CTCT CTGGAGAAAC 180 

TAAAATCTAO ACTOOACAOC ATCCACAAGA OAAOCACCTA GAAGAAGATT TTTTTTTCCC 240 

AGCAOCTTGC TCAGGACCCT OCAOOAOCTG CAOCCOOAAC TGQTCCCQCC OATAACC 297 

ATO AAC TCT TCC TOC TOC CCD TCC TCC TCT TAT CCO ACQ CTO CCT AAC 345 
Met Aan Scr Ser Cys Cye Pro Ser Ser Ser Tyx Pro Thr Leu Pro Asn 
1 5 10 15 

CTC TCC CAG CAC CCT OCA GCC CCC TCT OCC A0C AAC COO AQT OOC ACT 393 
Leu Ser Gin Hie Pro Ala Ala Pro Ser Ala Ser Aan Arg Ser Qly Ser 
20 25 30 

GOG TTC TOC OAO CAO OTT TTC ATC AAG CCA GAG OTC TTC CTG OCA CTQ 441 
Oly Phe Cyo Glu Gin Val Phe lie Lye Pro Olu Val Phe Leu Ala Leu 
35 40 45 

GGC ATC OTC AOT CTO ATG OAA AAC ATC CTO OTO ATC CTO OCT OTG GTG 489 
Gly lie Val Ser Leu Met Olu Aan lie Leu Val lie Leu Ala Val Val 
50 55 60 

AGO AAC OOC AAC CTO CAC TCC CCC ATO TAG TTC TTC CTG CTO AOC CTO 537 
Arg Aan Gly Asn Leu His Ser Pro Met Tyr Phe Phe Leu Leu Ser Leu 
65 70 75 80 

CTO CAO OCC GAC ATO CTO OTO AOC CTO TCC AAC TCC CTO GAG ACC ATC 585 
Leu Gin Ala Aap Met Leu Val Ser Leu Ser Asn Ser Leu Glu Thr lie 
85 90 95 

ATG ATC OTO OTT ATC AAC AOC GAC TCC CTO ACC TTO GAG GAC CAA TTC 633 
Met lie Val Val lie Asn Ser Asp Ser Leu Thr Leu Olu Asp Gin Phe 
100 105 no 

ATC CAO CAC ATO GAC AAC ATC TTC GAC TCT ATG ATC TOC ATC TCC CTO 681 
lie Gin His Met Asp Asn He Phe Asp Ser Met lie Cye lie Ser Leu 
115 120 125 

OTO GCC TCC ATC TOC AAC CTC CTO OCC ATC OCC OTO OAC AGO TAC GTC 729 
Val Ala Ser lie Cya Asn Leu Leu Ala He Ala Val Asp Arg Tyr Val 
130 135 140 

ACC ATC TTC TAT OCC CTC COT TAC CAC AOC ATC ATG ACQ OTT AGO AAA 777 
Thr He Phe Tyr Ala Leu Arg Tyr His Ser He Met Thr Val Arg Lye 
1*5 150 155 160 

OCC CTC TCC TTO ATC OTO OCC ATC TOO OTC TOC TOT OOC ATC TGC OOC 825 
Ala Leu Ser Leu He Val Ala lie Trp Val Cys Cya Oly He Cye Gly 
165 170 175 

GTG ATO TTC ATC OTC TAC TCC OAO AOC AAG ATO GTC ATC OTO TOC CTC 873 
Val Met Phe He Val Tyr Ser Glu ser Lye Met Val He Val Cya Leu 
180 185 190 
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FIG. SB 



ATC ACC ATO TTC TTC OCC ATQ OTO CTC CTC ATQ OOC ACC CTQ TAC ATC 921 
He Thr Met Phe Phe Ala Met Val Leu Leu Met Qly Thr Leu Tyx He 
195 200 205 

CAC ATO TTC CTC TTC OCC AGO CTO CAC OTC CAO COC ATC OCO OCA CTQ 969 
His Met Phe Leu Phe Ala Arg Leu His Val Ola Arg He Ala Ala Leu 
210 215 220 

CCA CCT OCT QAC QGO OTA OCC CCO CAO CAO CAC TCO TOC ATO AAO OOO 1017 
Pro Pro Ala Asp Gly Val Ala Pro Oln Oln His Ser Cys Met Lys Oly 
225 230 235 240 

OCC OTC ACC ATC ACC ATC CTQ CTQ OOO OTT TTC ATC TTC TOC TOO OCO 1065 
Ala Val Thr lie Thr He Leu Leu Oly Val Phe He Phe Cys Trp Ala 
245 250 255 

CCT TTC TTC CTC CAC CTO OTC CTC ATC ATC ACC TOC CCC ACC AAC CCC 1113 
Pro Phe Phe Leu His Leu Val Leu He He Thr Cys Pro Thr Asn Pro 
260 265 270 

TAC TOC ATC TOC TAC ACQ OCQ CAC TTC AAC ACC TAC CTO OTT CTC ATC 1161 
Tyr Cys He Cys Tyr Thr Ala His Phe Asn Thr Tyr Leu Val Leu He 
275 280 265 

ATO TOC AAC TCT OTC ATC OAC CCC CTC ATC TAC OCC TTC COC AOC CTO 1209 
Met Cys Asn Ser Val He Asp Pro Leu lie Tyr Ala Phe Arg Ser Leu 
290 295 300 

GAG CTQ COA AAC ACC TTC AAO GAG ATT CTC TOC GOT TOC AAT OOC ATO 1257 
Olu Leu Arg Asn Thr Phe Lys Olu He Leu Cys Oly Cys Asn Qly Met 
305 310 315 320 

AAC OTO OOC TAGGAACCCC GQAGGAOGTG TTCCACOOCT AGCCAAGAOA 1306 
Asn Val Oly 



GAAAAGCAAT GCTCAGGTGA GACACAOAAO GO 1338 
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FIG. 6 A 



AGCTTCCQAG AOGCAOCCGA TOTOAGCATO TQCQCACAGA TTCOTCTCCC AATOGCATOG 60 

CAOCTTCAAO GAAAATTATT TTGAACAGAC TTOAATOCAT AAGATTAAAO TTAAAQCAGA 120 

AGTGAGAACA AOAAAGCAAA GAGCAOACTC TTTCAACTGA GAATOAATAT TTTOAAGCCC 160 

AAQATTTTAA AOTGATGATG ATTAOAGTCG TACCTAAAAO AOACTAAAAA CTCCATOTCA 240 

AGCTCTGGAC TTOTOACATT TACTCACAOC AOOCATQOCA ATTTTAOCCT CACAACTTTC 300 

AGACAGATAA AOACTTQGAG QAAATAACTO AGACGACTCC CTGACCCAGG AOGTTAAATC 360 

AATTCAQQOG GACACTOGAA TTCTCCTGCC A0C ATO OTO AAC TCC ACC CAC COT 414 

Met Val Aon Ser Thr His Arg 

1 5 

aaa atg cac act tct cto cac ctc too aac coc aoc aot tac aoa era 462 

Gly Met Hio Thr Ser Leu His Leu Trp Asn Arg Ser Ser Tyr Arg Leu 
10 IS 20 

CAC AGC AAT GCC AOT GAG TCC CTT GGA AAA GOC TAC TCT GAT GGA GGG 510 
His Ser Asn Ala Ser Glu Ser Leu Gly Lye Gly Tyr Ser Asp Gly Gly 
25 * 30 35 

TOC TAC GAG CAA CTT TTT GTC TCT CCT GAG GTG TTT GTG ACT CTO GOT 558 
Cys Tyr Glu Gin Leu Phe Val Ser Pro Glu Val Phe Val Thr Leu Gly 
40 45 50 55 

GTG ATC AGC TTQ TTG GAG AAT ATC TTA GTG ATT GTG OCA ATA GCC AAG 606 
Val lie Ser Leu Leu Glu Asn He Leu Val He Val Ala He Ala Lys 
60 65 70 

AAC AAG AAT CTO CAT TCA OCC ATG TAC TTT TTC ATC TOC AGC TTG OCT 654 
Asn Lys Asn Leu His Ser Pro Met Tyr Phe Phe He Cys Ser Leu Ala 
75 80 85 

GTG OCT GAT ATG CTO GTG AGC GTT TCA AAT GGA TCA GAA ACC ATT ATC 702 
Val Ala Asp Met Leu Val Ser Val Ser Asn Gly Ser Glu Thr He He 
90 95 100 

ATC ACC CTA TTA AAC AOT ACA GAT ACQ GAT OCA GAG AOT TTC ACA GTG 750 
He Thr Leu Leu Asn Ser Thr Asp Thr Asp Ala Gin Ser Phe Thr Val 
105 110 115 

AAT ATT GAT AAT GTC ATT GAC TOO GTG ATC TOT AGC TCC TTG CTT GCA 798 
Asn lie Asp Asn Val He Asp Ser Val He Cys Ser Ser Leu Leu Ala 
120 125 130 135 

TCC ATT TOC AGC CTG CTT TCA ATT GCA GTG GAC AGO TAC TTT ACT ATC 646 
Ser He Cys Ser Leu Leu Ser He Ala Val Asp Arg Tyr Phe Thr He 
140 145 150 

TTC TAT OCT CTC CAG TAC CAT AAC ATT ATG ACA GTT AAG COG GTT GGG 894 
Phe Tyr Ala Leu Gin Tyr His Asn He Met Thr Val Lys Arg Val Gly 
155 160 165 

ATC AGC ATA AOT TOT ATC TOG GCA OCT TGC ACQ GTT TCA GOC ATT TTG 942 
He Ser He Ser Cys He Trp Ala Ala Cys Thr Val Ser Gly He Leu 
170 175 160 
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FIG. 6B 



TTC ATC ATT TAC TCA GAT AQT AOT OCT GTC ATC ATC T3C CTC ATC ACC 990 
Phe lie lie Tyr Ser Asp Ser Ser Ala Val lie He Cys Leu He Thr 
185 190 195 

ATQ TTC TTC ACC ATO CTO OCT CTC ATQ OCT TCT CTC TAT OTC CAC CTG 1038 
Met Phe Phe Thr Met Leu Ala Leu Met Ala Ser Leu Tyr Val His Leu 
200 205 210 215 

TTC CTO ATO QCC AGO CTT CAC ATT AAQ AGO ATT OCT OTC CTC CCC OOC 1086 
Phe Leu Met Ala Arg Leu H1b He Lys Arg He Ala Val Leu Pro Oly 
220 225 230 

ACT GOT QCC ATC COC CAA GGT GCC AAT ATO AAG OGA OCO ATT ACC TTO 1134 
Thr Oly Ala He Arg Gin Oly Ala Aan Met Lys Oly Ala He Thr Leu 
235 240 245 

ACC ATC CTG ATT GGC GTC TTT GTT GTC TGC TOO GCC CCA TTC TTC CTC 1182 
Thr He Leu He Oly Val Phe Val Val Cys Trp Ala Pro Phe Phe Leu 
250 255 260 

CAC TTA ATA TTC TAC ATC TCT TGT CCT CAG AAT CCA TAT TOT GTG TGC 1230 
His Leu He Phe Tyr He Ser Cye Pro Gin Asn Pro Tyr Cys Val Cys 
265 270 275 

TTC ATC TCT CAC TTT AAC TTO TAT CTC ATA CTG ATC ATG TGT AAT TCA 1278 
Phe Met Ser His Phe Asn Leu Tyr Leu He Leu He Met Cys Asn Ser 
280 285 290 295 

ATC ATC GAT CCT CTG ATT TAT GCA CTC CGG AGT CAA GAA CTO AGO AAA 1326 
He He Asp Pro Leu He Tyr Ala Leu Arg Ser Gin Glu Leu Arg Lys 
300 305 310 

ACC TTC AAA GAG ATC ATC TCT TCC TAT CCC CTG OGA GOC CTT TGT GAC 1374 
Thr Phe Lys Glu He He Ser Ser Tyr Pro Leu Oly Gly Leu Cys Asp 
315 320 325 

TTO TCT AGC AGA TAT TAAATGGGGA CAGAGCACGC AATATAGGAA CATCCATAAG 1429 
Leu Ser Ser Arg Tyr 
330 

AGACTTTTTC ACTCTTACCC TACCTGAATA TTCTACTTCT GCAACAGCTT TCTCTrCCO T 1489 

GTAGGGTACT GGTTGAGATA TCCATTOTGT AAATTTAAGC CTATGATTTT TAATOAGAAA 1549 

AAATOCCCAG TCTCTCTATT ATTTCCAATC TCATGCTACT TTTTTGGCCA TAAAATATGA 1609 

ATCTATOTTA TAGGTTOTAG GCACTGTGGA TTTACAAAAA GAAAAGTCCT TATTAAAAGC 1669 

77 1671 
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FIG. 7 A 



ATO AAC TCC TCC TCC ACC CTO ACT OTA TTQ AAT CTT ACC CTQ AAC OCC 46 
Met Asn Ser Ser Ser Thr Leu Thr Val Leu Aan Leu Thr Leu Aan Ala 
IS 10 15 

TCA GAO OAT OOC ATT TTA GQA TCA AAT OTC AAO AAC AAG TCT TTG OCC 96 
Ser Olu Asp Qly lie Leu Oly Ser Asn Val Lya Aan Lya Ser Leu Ala 
20 25 30 

TOT GAA QAA ATG GGC ATT OCC GTO GAO OTO TTC CTO ACC CTG GOT CTC 144 
Cye Glu Olu Met Oly lie Ala Val Olu Val Phe Leu thr Leu Qly Leu 
35 40 45 

OTC AOC CTC TTA OAG AAC ATC CTO GTC ATT OQO OCC ATA OTA AAO AAC 192 
Val Ser Leu Leu Olu Aan lie Leu Val He Oly Ala He Val Lya Aan 
50 55 60 

AAA AAC CTO CAC TCA CCC ATO TAC TTC TTT OTG GGC AOC TTA GCC GTO 240 
Lye Aan Leu His Ser Pro Met Tyr Phe Phe Val Gly Ser Leu Ala Val 
65 70 75 60 

OCC GAC ATG CTO GTO AOC ATG TCC AAT OCC TOO GAO ACT GTC ACC ATA 286 
Ala Asp Met Leu Val Ser Met Ser Aan Ala Trp Olu Thr Val Thr He 
85 90 95 

TAC TTG CTA AAT AAT AAA CAC CTO GTG ATA GCC GAC ACC TTT GTO CGA 336 
Tyr Leu Leu Aan Asn Lya Hia Leu Val He Ala Asp Thr Phe Val Arg 
100 105 110 

CAC ATC OAC AAC OTO TTC OAC TCC ATO ATC TOC ATC TCT GTG GTG GCC 384 
Hia He Asp Aan Val Phe Aap Ser Met He Cye He Ser Val Val Ala 
115 120 125 

TCG ATG TOC AGT TTG CTO GCC ATT GCG GTG GAT AGO TAC ATC ACC ATC 432 
Ser Met Cye Ser Leu Leu Ala He Ala Val Aap Arg Tyr He Thr He 
130 135 140 

TTC TAT OCC TTG CGC TAC CAC CAC ATC ATO ACC GCG AGO COC TCG GGG 480 
Phe Tyr Ala Leu Arg Tyr Hia His He Met Thr Ala Arg Arg Ser Gly 
145 150 155 160 

GTO ATC ATC GCC TOC ATT TOO ACC TTC TGC ATA AGC TGC GGC ATT GTT 528 
Val He He Ala Cya He Trp Thr Phe Cys He Ser Cya Oly He Val 
165 170 175 

TTC ATC ATC TAC TAT GAO TCC AAO TAT OTG ATC ATT TGC CTC ATC TCC 576 
Phe He He Tyr Tyr Olu Ser Lya Tyr Val He He Cya Leu He Ser 
180 185 190 

ATO TTC TTC ACC ATO CTO TTC TTC ATO GTO TCT CTO TAT ATA CAC ATO 624 
Met Phe Phe Thr Met Leu Phe Phe Met Val Ser Leu Tyr He His Met 
195 200 205 

TTC CTC CTO GCC CGG AAC CAT OTC AAO CGG ATA OCA OCT TCC CCC AGA 672 
Phe Leu Leu Ala Arg Aan His Val Lys Arg He Ala Ala Ser Pro Arg 
210 215 220 

TAC AAC TCC GTG AGO CAA AGG ACC AGC ATG AAG GGG OCT ATT ACC CTC 720 
Tyr Aan Ser Val Arg Gin Arg Thr Ser Met Lye Gly Ala He Thr Leu 
225 230 235 240 
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FIG. 7B 



ACC ATG CTA CTG GOG ATT TTC ATT OTC TGC TOO TCT CCC TTC TTT CTT 768 
Thr Met Leu Leu oly lie Phe lie Val Cya Trp Ser Pro Phe Phe Leu 
245 250 255 

CAC CTT ATC TTA ATO ATC TCC TQC CCT CAG AAC OTC TAG TOC TCT TOC 816 
Hie Leu He Leu Met lie Ser Cya Pro Oln Aan Val Tyr CyB Ser Cya 
260 265 270 

TTT ATG TCT TAG TTC AAC ATG TAC CTT ATA CTC ATC ATG TQC AAC TCC 664 
Phe Met Ser Tyr Phe Asn Met Tyr Leu He Leu He Met Cya Aan Ser 
275 280 285 

GTG ATC GAT CCT CTC ATC TAC GCC CTC CGC AGC CAA GAG ATG CGG AGG 912 
Val He Asp Pro Leu He Tyr Ala Leu Arg Ser Gin Glu Met Arg Arg 
290 295 300 

ACC TTT AAG GAG ATC GTC TGT TGT CAC GGA TTC CGG CGA CCT TGT AGG 960 
Thr Phe Lya Glu He Val Cya Cya His Gly Phe Arg Arg Pro Cya Arg 
305 310 315 320 

CTC CTT GGC GGG TAT TAA g78 
Leu Leu Gly Gly Tyr * 
325 
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FIG. 8 
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